WO 01/23604 



PCT/CA00/01150 



Annex XLVTI: Molecular beacon internal hybridization probes for 
specific detection of tuf sequences* 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 SEQ ID 



Nucleotide 
NO. position 



Bacterial species ; Chlainydia pneumonias 

2091 5 ' -CQC QM1 TTG AGA TGG AAC TTA GTG AGC 20 157-183 

TTC TTG GTC GCG 

2092 5 ' -CGC GAC GAA AGA ACT TCC TGA AGG TCG 20 491-516 

TGC AG£ T££ AQ 



Bacterial 


st>ecies : 


2213 


5 ' -CGT GCC 




GAC GCT 


Bacterial 


.species: 


1236 


5 ' -GCG AGC 




GGC TCG 


Bacterial 


species : 


1235 


5 ' -GCG AGC 




TGC TG£ 


Bacterial 


snecies: 


2084 c 


5 ' -CAC GCG 




TTT TGC 


Bacterial 


species: 


2096 c 


5 ' -CGC GAC 




TGT CGC 


Bacterial 


species : 



Chlamydia trachomatis 

SAC ATG ATT TCC GAA GAA 
SGC ACG 

Enterococcus faecalis 

3GT GAA GTT CGC GTT GGT 

Enterococcue faecium 

IGT TGA AGT TGT TGG TAT 

Legionella pneumophila 

LCA CCC GTA CAA GTC GTC 

Mycoplasma pneumoniae 

'AC CAC GGC CAG TAA TCG 

Neisseria gonorrhoeae 



1739 b 412-441 



883 370-391 



64 412-437 



112 461-486 



2097 b 658-679 



2177 5 ' -GGC ACG SAC AAA CCA TTC CTG CTG CCT 126 323-357 

ATC GAA ACG TGT TC£ CGT GCC 

2178 5 ' -GGC &CG ACA AAC CAT TCC TGC TGC CTA 126 323-348 

TCG AAC GTG ££ 

2179 5'-G£Q A££ TCT ACT TCC GTA CCA CTG ACG 126 692-718 

TAA CCG GCT GCC 

a Underlined nucleotides indicate the molecular beacon's stem. 
b Sequence from databases. 

c This sequence is from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XLVXXs Molecular beacon internal hybridization probes for 
specific detection of tuf sequenc s (continued) . 

Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 

Bacterial species : Pseudomonas aeruginosa 

2122 5'-QQG &Q£ GAA TGT AGG AGT CCA GGG TCT 153 , 880, 2138 b * c 280-302 d 

CTQ CTC GG 

Bacterial species : Staphylococcus aureus 

2186 5 £££ TCA AAG CAG AAG TAT ACG TAT 1728 615-646 

TAT CAA AAG AC£ CGC GT 



1233 


5' -Q£Q 
CGQ 


Funaal 


st>ecies : 


2073 


5 ' -CCG 
AAC 


Fungal 


..species: 


2074 


5 ' -CCG 
AAC 


Funaal 


siDecies : 


2110 b 


5 ' -GCG 
TGG 


2111 


5 ' -GCG 
CTT 


Funaal 


soecies: 



Staphylococcus sp. other than S. aureus 

£T GGT GTA GAA ATG TTC 878 372-394 

Candida albicans 



Candida dubliniensis 



Candida glabrata 



408 404-429 
414 416-441 

AA CGA TTT CAG CGA ATC 417 307-335 
CC GC 

TG AAG CCA CCA CCA ACG 417 419-447 
CG C 

Candida krusei 

2112 b 5 ' -GCG £££ TTG ATG AAG TTT GGG TTT CCT 422 318-347 
TGA CAA TTG CCC GC 

2113 5 ' -GCG GGC ACA AGG GTT GGA CTA AGG AAA 422 419-447 

CCA AGG CAfi £££ S£ 

2114 5 ' -GCG -GGC ATC GAT GCT ATT GAA CCA CCT 422 505-533 

GTC AGA CCfi £££ 



a Underlined nucleotides indicate the molecular beacon's stem. 
b Sequence from databases. 

c These sequences were aligned to derive the corresponding primer. 

d ^ e «S UC ^??? ide P° sifcions refer to the p. aeruginosa tuf sequence fragment (SEQ 
ID NO. 153 ) . 
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Annex XLVII: Molecular beacon internal hybridization probes for 
specific det ction of tuf sequences (continu d) . 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 



SEQ ID 
NO. 



Nucleotide 
position 



Fungal species ; 



Candida lusitaniae 



2115 b 

2116 

2117 



5'-G£G QQC GGT AAG TCC ACC GGT AAG ACC 
TTG TTG GCC CGC 

5'-£££ GQQ GTA AGT CAC CGG TAA GAC CTT 
GTT GGC CCG £ 

5 ' -GCG GGC GAC GCC ATT GAG CCA CCT TCG 
AGA £££ CGC 



Fungal species 



Candida, parapsilosis 



2118 b 



5' -G£Q QQQ TCC TTG ACA ATT TCT TCG TAT 
CTG TTC TTG GCC CGC 



Funaal species : 



Candida tropicalis 



2119 
2120 



5'-£CG QSQ TTA CAA CCC TAA GGC TGT TCC 
ATT CGT TGC CCG £ 

5'-G££ £££ AGA AAC CAA GGC TGG TAA GGT 
TAC CGG A££ CCG £ 



424 
424 
424 

426 

429 
429 



304-330 
476-502 
512-535 

301-330 

357-384 
459-487 



Fungal gpficie*: 



Cxyptococcus neoformans 



2106 5 '"Sc GC AGA GCA CGC CCT C " CGC C5C- 623,1985,1986^ 226-244* 

2107 5 ^gg ^ ^ CCA TCT CTG GTT GGC A££ 623 , 1985 , 1986<= 390-408* 



Bacter ial genua 



Legionella sp. 



2083 



5 '-££5 £££ ATG TTC CGT AAA TTA CTT GAI 
GAA GGT CGA GCC GGC QQ 



111-112° 



488-519 e 



Underlined nucleotides indicate the molecular beacon's stem. 

This sequence is from the complementary DNA strand o<f *-v.<* c^,^ * ^ 

c originating fragment given in the Seju^e Liitfn? the seguence of the 

* These sequences were aligned to derive the corresponding primer 

^U^^^lD^^ff. t0 thS C ' tuf (EF-1) sequence 

6 ^g^tT^ZD^lSu refer tC ^ L - *™™* hil * (EF-1) sequence 
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Annex XI.VTI: Molecular beacon internal hybridization probes for 
specific detection of tuf sequences (continued) . 



SEQ ID NO. Nucleotide sequence 3 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Funaal genus ; 



Candida. sp. 



2108 5'-GC£ GGC AAC TTC RTC AAG AAG GTT GGT 

TAC AAC CCG CCC GC 



2109 5 l -Q££ Q£Q CCA ATC TCT GGT TGG AAY GGT 

GAC AA£ CCC GC 



Bacterial group: 



Pseudomonads 



2121 5'-£G& ££S CIA GCC GCA CAC CAA GTT CQQ 

GTC Q 



414,417, 
422,424, 
426,429,624 b 

Same as SEQ 
ID NO. 2108 



52-80° 



100-125 c 



153-155, 598-616 e 
205,880,2137 d , 



* Underlined nucleotides indicate the molecular beacon's stem. 
These sequences were aligned to derive the corresponding primer. 

° TsEQ n ro 1 NO? i 624?? Siti ° nS t0 ^ ° 9 albicans tuf sequence fragment 

d Sequence from databases. 

e S e iro UC i53f ide pOSitions refer to the p - aeruginosa tuf sequence fragment (SEQ 
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Annex XLVIII: Molecular beacon internal hybridization probes for 
specific detection of ddl and mfcl gene sequences. 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence* ' SEQ ID Nucleotide 

NO. position 

Bacterial species; e. faealum (ddl) 

1237 5'-QQS AQ£ CGC GAA ATC GAA GTT GCT GTA 1242 b 334-^9 

TTA GGQ £T£ G£ 

Bacterial species: faecalls (mtl) 

1238 5'-Q£Q GGC GTT AAT TTT GGC ACC GAA 1243 b 

GAA GAS ££Q 



* Underlined nucleotides indicate the molecular beacon's stem 
Sequence from databases. 
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Annex XLXXs Internal hybridization probe for specific detection 
of S. aureus sequences of unknown coding potential. 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species: Staphylococcus aureus 

1234 5 ' -ACT AAA TAA ACG CTC ATT CG 1244 
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Annex L: Specific and ubiquitous primers for nucl ic acid 
amplification (antimicrobial agents resistance genes 
sequences) • 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance g ene: 



aac (2')-Xa. 



1344 
1345 b 

1346 
1347 b 



5' -AGC AGC AAC GAT GTT ACG CAG CAG 
5'-CCC GCC GAG CAT TTC AAC TAT TG 



5' -GAT GTT ACG CAG CAG GGC AGT C 
5' -ACC AAG CAG GTT CGC AGT CAA GTA 



Resist ance aene : 



aac f3'J-Xb 



1349 
1350 b 



5' -CAG CCG ACC AAT GAG TAT CTT GCC 
5' -TAA TCA GGG CAG TTG CGA CTC CTA 



Resistance gene : 



aac(3')-2Xh 



1352 
1353 b 

1354 
1355 b 



5 '-CCA CGC TGA CAG AGC CGC ACC G 
5 ' -GGC CAG CTC CCA TCG GAC CCT G 

5'-CAC GCT GAC AGA GCC GCA CCG 
5'-ATG CCG TTG CTG TCG AAA TCC TCG 



Resistance aene: 



aac (3') -XVa 



1357 
1358 b 

1359 
1360 b 



5' -GCC CAT CCA TTT GCC TTT GC 
5'-GCG TAC CAA CTT GCC ATC CTG AAG 

5' -TGC CCC TGC CAC CTC ACT C 
5'-CGT ACC AAC TTG CCA TCC TGA AGA 



1348 a 
1348 a 

1348 a 
1348 a 



1351 a 
1351 a 



1356 a 
1356 a 

1356 a 
1356 a 



1361 a 
1361 a 

1361 a 
1361 a 



163-186 
392-414 

172-193 
467-490 



178-201 
356-379 



383- 404 
585-606 

384- 404 
606-629 



295-314 
517-540 

356-374 
516-539 



Resistance aene : 



aac(3')-vxa 



1362 
1363 b 

1364 
1365 b 



5' -CGC CGC CAT CGC CCA AAG CTG G 
5' -CGG CAT AAT GGA GCG CGG TGA CTG 

5' -TTT CTC GCC CAC GCA GGA AAA ATC 
5 '-CAT CCT CGA CGA ATA TGC CGC G 



Resistance aene: 



aac(6')-ia 



1367 
1368 b 

1379 
1380 b 



5' -CAA ATA TAC TAA CAG AAG CGT TCA 
5'-AGG ATC TTG CCA ATA CCT TTA T 

5 ' -AAA CCT TTG TTT CGG TCT GCT AAT 
5 ' -AAG CGA TTC CAA TAA TAC CTT GCT 



1366 a 
1366 a 

1366 a 
13 66 a 



1371 a 
1371 a 

1371 a 
1371 a 



285-306 
551-574 

502-525 
681-702 



■ 56-79 
269-290 

153-176 
320-343 



a Sequence from databases . 

b These sequences are from the complementary DNA strand 
originating fragment given in the Sequence Listing. 
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X LS SSJJJ 0 -*^™*' ^Witous primers for nucleic acid 
amplification (antimicrobial agents resistance genes 
sequences) (continued) . genes 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



Resistance 


aene: 


1372 


5 ' -GCT 


1373 b 


5 ' -CAC 


1374 


5 ' -AGA 


1375 b 


5 ' -CCC 


Resistance 


..qene: 


1377 


5 ' -GCC 


1378 b 


5 ' -GCT 


1379 


5 ' -AGC 


1380 b 


5 ' -CGC 


Resistance 


aener 


1382 


5 ' -TAG 


1383 b 


5 ' -CCC 


1384 


5' -GAT 


1385 b 


5 ' -TCC 


Resistance 


oene : 


♦ 1387 


5'-TTA ' 


1338 b 


5'-TAC I 



aac(6')-lc 



ant (3')-Ta 



ant(4')-Xa 



--*? 89 _ 5 L-G^A TAA CGG TTT ^GGT TGA TGC GAG 

1390 b 5'-ATG GCA AGA TCC TGG TAT CGG TCT 

Resistance gene; *ph(3')-IXa 



1392 
1393 b 

1394 
1395 b 



5 ' -TGG GTG GAG AGG CTA TTC GGC TAT 
5'-CAG TCC CTT CCC GCT TCA GTG AC 

5'-GAC GTT GTC ACT GAA GCG GGA AGG 
5' -CTT GGT GGT CGA ATG GGC AGG TAG 



SEQ ID 
NO. 



1376 a 
1376 a 

1376 a 
1376 a 



1381 a 
1381 a 

1381 a 
1381 a 



1386 a 
1386 a 

1386 a 
1386 a 



1391 a 
1391 a 

_1391 a 
1391 a 



13 96 a 
1396 a 

1396 a 
1396 a 



Nucleotide 
position 



157-180 
304-326 

104-127 
301-323 



100-123 

221- 244 

127-150 

222- 245 



149-172 
386-408 

151-172 
388-409 



233-256 
488-511 

468-491 
669-692 



43-66 
250-272 

244-267 
386-409 



a Sequence from databases. 

b These sequences are from the complementary DNA strand 
originating fragment given in the Sequence Listing. 



of the sequence of the 
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Annex x»i Specific and ubiquitous narimers - 

amplification (antimicrobial "^ents residence* c*™ 
sequences) (continued) . resistance g nes 



SEQ ID NO. Nucleotide sequence 

apA<\3';-xxxa 



Resistanrp 


oene : 


1397 


5 ' -GTG 


1398 b 


5 ' -ATG 


1399 


5 ' -ACC 


1400 b 


5 ' -CGA 


Resistanrp 


aener 


1402 


5 ' -TAT 


1403 b 


5 ' -TCA 



aph(3')-VXa 



1404 
1405 b 



5 ' -AAA CAG CGT TTT AGA GCC AAA TAA 
5'-TTC TCA GAG AGC CAA CTC AAC ATT 



Resistanrp 


aene: 


1407 


5 ' -CCC 


1408 b 


5 ' -TTG 


1409 


5 ' -TGG 


1410 b 


5 ' -GGG 


Resistanrp 


aene: 


1412 


5 1 -GAG 


1413 b 


5 ' -CAT 


1414 


5 ' -TAT 


1415 b 


5 ' -ATG 



blaCARB 



blaCMY-2 



Resistance g^noc . 



bla.CTX-M-1, blaCTX-M-2 



1417 
1418 b 



5' -TGG TTA ACT AYA ATC CSA TTG CGG A 
5' -ATG CTT TAC CCA GCG TCA GAT T 
Resist ance g pn^. 



blaCTX-M-1 



1419 
1420 b 

1421 
1422 b 



5' -CGA TGA ATA AGC TGA TTT CTC ACG 
5'-TGC TTT ACC CAG CGT CAG ATT ACG 

5'-AAT TAG AGC GGC AGT CGG GAG GAA 
5'-GAA ATC AGC TTA TTC ATC GCC ACG 



Originating DNA fragment 



SEQ id 
NO. 



1401 a 
1401 a 

1401 a 
1401 a 



1406 a 
1406 a 

1406 a 
1406 a 



1411 a 
1411 a 

1411 a 
1411 a 



1416 a 
1416 a 

1416 a 
1416 a 



1423 a 
1423 a 



1423 a 
1423 a 

1423 a 
1423 a 



Nucleotide 
position 



103-124 
355-374 

160-183 
354-376 



18-41 
175-197 

36-59 
177-200 



351-374 
556-577 

205-227 
329-353 



793-813 
975-995 

90-110 
439-459 



314-338 
583-604 



410-433 
580-603 

116-139 
405-428 



a Sequence from databases. ~ 
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Annex L: Specific and 



, - * . ^ ubiquitous primers for nucleic acid 

amplification (antimicrobial agents r sistance 
sequences) (continued). e 



genes 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



Resist ance g pnp 



1424 


5 ' 


1425 b 


5' 


1426 


5' 


1427 b 


5' 


Resistanrp 


oene : 


1429 


5'< 


1430 b 




1431 


5'- 


1432 b 


5'- 


Resistance 


aene : 


1434 


5'- 


1435 b 


5'- 


Resistance 




1436 


5'- 


1437 b 


5'- 


Resistance 


crene.: 


1440 


5 ' - 



blaCTX-M-2 

-GTT AAC GGT GAT GGC GAC GCT AC 
-GAA TTA TCG GCG GTG TTA ATC AGC 



blaXMP 

-TTT ACG GCT AAA GAT ACT GAA AAG T 



blaOXA2 



blaOXAlO 



blaPER-1 



144lt> 5 ' -TTT TTT GCT CAA ACT TTT TCA GGA TC 

Resist ance aene : blaPER-2 



1443 
1444 b 



5'-CTT CTG CTC TGC TGA TGC TTG GC 
5' -GGC GAC CAG GTA TTT TGT AAT ACT GC 

Resistance <?enes : 



blaPER-1, blaPER-2 



1446 
1447 b 

1448 
1449 b 



5 ' -GGC CTG VGA TTT GTT ATT TGA ACT GGT 
5'-CGC TST GGT CCT GTG GTG GTT TC 

5' -GAT CAG GTG CAR TAT CAA AAC TGG AC 
5 ' -AGC WGG TAA CAA YCC TTT TAA CCG CT 



a Sequence from databases. — - ™ 

These sequences are from the complementary DNA strand 
origxnating fragment given in the Sequence Listing. 
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1428 a 
1428 a 

1428 a 
1428 a 



1433 a 
1433 a 

1433 a 
1433 a 



1438 a 
1438 a 



1439 a 
1439 a 



1442 a 
1442 a 



1445 a 
1445 a 



1442 a 
1442 a 

1442 a 
1442 a 



30-52 
153-176 

510-531 
687-710 



205-229 
513-538 

191-215 
497-522 



319-343 
532-556 



194-217 
479-504 



281-306 
579-604 



32-54 
304-329 



414-440 
652-674 

532-557 
671-696 
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Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance genes 
sequences) (continued). 9 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ id 
NO. 



Nucleotide 
position 



1883 


5 ' -AGC 


1884 b 


5 ' -GTA 


1885 


5 ' -AGC 


1 rra!> 


5 ' —GTA 


Resistance 


aene : 


1906 


5 ' -CCT 


1907 b 


5 ' -CAC 


1908 


5 ' -AAC 


1907 b 


5 ' -CAC 


Resistance. 


aene: 


2145 


5 ' -GCA 


2146 b 


5 ' -GGG 


Resistance 


aene: 


2148 


5 ' -CAG 


2149 b 


5 ' -ATC 


Resistance 


gene: 



hlaSHV 



blaTEM 



cat J 



catll 



cat XII 



2151 
2152 b 



Resistance g ene: 



5 ' -ATA TTT CAG CAT TAC CTT GGG TT 
5'-TAC ACA ACT CTT GTA GCC GAT TA 

catP 



2154 
2155 b 



5 ' -CGC CAT TCA GAG TTT AGG AC 
5'-TTC CAT ACC GTT GCG TAT CAC TT 



Resistance aene ? 



cat 



2157 
2158 b 

2160 
2161 b 



5 ' -CCA CAG AAA TTG ATA TTA GTG TTT TAT 
5 ' -TCG CTA TTG TAA CCA GTT CTA 

5 ' -TTT TGA ACA CTA TTT TAA CCA GC 
5' -GAT TTA ACT TAT CCC AAT AAC CT 



1900Q 
1900 a 

1900 a 
1900 a 



1927 a 
1927 a 

1927 a 
1927 a 



2147 a 
2147 a 



2150 a 
2150 a 



2153 a 
2153 a 



2156 a 
2156 a 



2159 a 
2159 a 

2162 a 
2162 a 



71-93 
763-785 

313-333 
763-785 



27-46 
817-838 

148-170 
817-838 



363-384 
484-506 



67-88 
151-173 



419-441 
603-625 



178-197 
339-361 



89-115 
201-221 

48-70 
231-253 



a Sequence from databases. ~ - - 

b These sequences are from the complementary DNA strand 
originating fragment given in the Sequence Listing. 
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Annex L: 



Specific and 
amplification 



ubiquitous 



r 4 • ^. Primers for nucl ic acid 
(antimicrobial agents resistanc 
seguenc s) (continued) • ™ 



genes 



SEQ ID NO. 



Nucleotide sequence 







1450 


5'- 


1451 b 


5'- 


Resistance 


aene: 


1457 


5' - 


1458 b 


5'- 


1459 


5 


1560 b 


5'- 


Resistant 


genes 


1453 




1454 b 


5'-< 



1455 
1456 b 



Resistanrp 


aene: 


1466 


5' -AAG 


1467 b 


5 ' -AAT 


1468 


5 ' -CGT 


1469 b 


5 ' -CAT 


Resistance 


_,qen<* : 



dfrA 

-ACC ACT GGG AAT ACA CTT GTA ATG GC 
-ATC TAC CTG GTC AAT CAT TGC TTC GT 

cLhfrXa 

•CAA AGG TGA ACA GCT CCT GTT T 
TCC GTT ATT TTC TTT AGG TTG GTT AA 

AAG GTG AAC AGC TCC TGT TT 
GAT CAC TAC GTT CTC ATT GTC A 

: dhfrla, dhfrXV 

ATC GAA GAA TGG AGT TAT CGG RAA TG 
CCT AAA AYT RCT GGG GAT TTC WGG A 

5 ' -CAG GTG GTG GGG AGA TAT ACA AAA 

5 ' -TAT GTT AGA SRC GAA GTC TTG GKT AA 

dhfttb 



dhfrV 



1471 
1472 b 



5' -GAT AAT GAC AAC GTA ATA GTA TTC CC 
5 ' -GCT CAA TAT CAA TCG TCG ATA TA 

5--TTA AAG CCT TGA CGT ACA ACC AGT GG 
5 ' -TGG GCA ATG TTT CTC TGT AAA TCT CC 
Resist ance g^n^g . 



1473 
1474 b 



1462 
1463 b 

1464 
1465 b 



dlifrlb, dlifrV 



5' -GCA CTC CCY AAT AGG AAA TAC GC 
5' -AGT GTT GCT CAA AAA CAA CTT CG 

5'-ACG TTY GAA TCT ATG GGM GCA CT 
5 ' -GTC GAT AAG TGG AGC GTA GAG GC 



Originating dna fragment 



SEQ ID 
NO. 



Nucleotide 
position 



1452 a 
1452 a 



1461 a 
1461 a 

1461 a 
1461 a 



1461 a 
1461 a 

1461 a 
1461 a 



1470 a 
1470 a 

1470 a 
1470 a 



1475- 
1475 a 

1475 a 
1475 a 



1470 a 
1470 a 

1470 a 
1470 a 



106-131 
296-321 



75-96 
249-275 

77-96 
207-228 



27-52 
384-408 

290-313 
416-441 



98-120 
204-230 

183-208 
354-376 



208-233 
342-364 

95-120 
300-325 



157-179 
405-427 

139-161 
328-350 



* Sequence from databases. 

These sequences axe from the complementary DNA . ^ , 

originating fragment given in th^slSen^L^ing sequence of the 

332 



WO 01/23604 



PCT/CA00/01150 



sequences) (continued). g ts resistance genes 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



Besistance cj<=>^<=> 



dhfrVX 



1476 
1477 b 

1478 
1479 b 



5' -GGC GAG CAG CTC CTA TTC AAA G 

5 '-TAG GTA AGC TAA TGC CGA TTC AAC A 

5 '-GAG AAT GGA GTA ATT GGC TCT GGA TT 
5' -GCG AAA TAC ACA ACA TCA GGG TCA T 



dhfrVXX 



1485 
1486 b 

1487 
1488 b 



5 * -AAA ATG GCG TAA TCG GTA ATG GC 

5 '-CAT TTG AGC TTG AAA TTC CTT TCC TC 

5' -AAT CGA AAA TAT GCA GTA GTG TCG AG 
5'-AGA CTA TTG TAG ATT TGA CCG CCA 



Besistance cr^-n^^ . 



dhfrVXX, dhfrXVII 



1481 
1482 b 

1483 
1484 b 



5'-RTT ACA GAT CAT KTA TAT GTC TCT 
5'-TAA TTT ATA TTA GAC AWA AAA AAC TG 

5 ' -CAR YGT CAG AAA ATG GCG TAA TC 

5 ' -TKC AAA GCR WTT TCT ATT GAA GGA AA 



Resistance geng: 



dhfrVXXX 



1490 
1491 b 

1492 



5' -GAC CTA TGA GAG CTT GCC CGT CAA A 
5 ' -TCG CCT TCG TAC AGT CGC TTA ACA AA 

5 . -CAT TTT AGC TGC CAC CGC CAA TGG TT 
5'-GCG TCG CTG ACG TTG TTC ACG AAG A 



Resistance ry^no . 



dhfrXX 



1495 
1496 b 

1497 
1498 b 



5' -TCT CTA AAC ATG ATT GTC GCT GTC 
5' -CAG TGA GGC AAA AGT TTT TCT ACC 

5'-CGG ACG ACT TCA TGT GGT AGT CAG T 
5'-TTT GTT TTC AGT AAT GGT CGG GAC CT 



* Sequence from databases. 



SEQ ID 
NO. 



1480 a 
1480 a 

1480 a 
1480 a 



1489 a 
1489 a 

1489 a 
1489 a 



1489 a 
1489 a 

1489 a 
1489 a 



1494 a 
1494 a 

1494 a 
1494 s 



1499a 
1499a 

1499a 
1499a 



Nucleotide 
position 



79-100 
237-261 

31-56 
209-233 



32-54 
189-214 

166-191 
294-317 



268-291 
421-446 

23-45 
229-254 



144-168 
376-401 

18-43 
245-269 



7-30 
133-156 

171-195 
446-471 



of the sequence of the 
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SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO- 



Nucleotide 
position 



1500 


5' 


1501 b 


5' 


1502 


5' 


1503 b 




ResistanrP 


_3ene: 


1505 


5'- 


1506 h 


5'- 


Resistanrp 


oene: 



1508 


5' 


1509 b 


5 ' 


1510 


5' 


1511 b 


5' 


Resistanrp 


aene: 


1513 


5' 


1514 b 


5' 



1515 
1516 b 



5' 
5' 



^w_^ w aaafeAhs - 



dhfrXXX 

-ATC GGG TTA TTG GCA ATG GTC CTA 
-GCG GTA GTT AGC TTG GCG TGA GAT T 

-GCG GGC GGA GCT GAG ATA TAC A 
-AAC GGA GTG GGT GTA CGG AAT TAC AG 

dhfrXXXX 

-ATT TTT CGC AGG CTC ACC GAG AGC 
•CGG ATG AGA CAA CCT CGA ATT CTG CTG 

dhfrXV 

-AGA ATG TAT TGG TAT TTC CAT CTA TCG 
-CAA TGT CGA TTG TTG AAA TAT GTA AA 

-TGG AGT GCC AAA GGG GAA CAA T 
-CAG ACA CAA TCA CAT GAT CCG TTA TCG 

dhfrXVXX 

-TTC AAG CTC AAA TGA AAA CGT CC 
-GAA ATT CTC AGG CAT TAT AGG GAA T 

-GTG GTC AGT AAA AGG TGA GCA AC 
-TCT TTC AAA GCA TTT TCT ATT GAA GG 



1504 a 
1504 a 

1504 a 
1504 a 



1507 a 
1507 a 



1512 a 
1512 a 

1512 a 
1512 a 



1517 a 
1517 a 

1517 a 
1517 a 



50-73 
201-225 

304-325 
452-477 



106-129 
413-439 



215-241 
336-361 

67-88 
266-292 



201-223 
381-405 

66-88 
232-257 



2102 
2103 b 



5'-CAC CTT CAC CCT GAC CGA CG 
5 ' -CGA ACC AGC GGA AAT AGT TGG AC 

Resistance cr^n^- 



1528 
1529 b 



ejreA, ereA2 



5'-AAC TTG AGC GAT TTT CGG ATA CCC TG 
5' -TTG CCG ATG AAA TAA CCG CCG ACT 



2105 a 
2105 a 



1530 a 
1530 a 



822-841 
948-970 



80-105 
317-340 



a Sequence from databases. 
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Annex L: Specific and 



, ubxquxtous primers for nucleic acid 

amplxfication (antimicrobial agents resistance 
sequences) (continued). 



genes 



SEQ ID NO. 



Nucleotide sequence 



Resistance 


aene 


1531 


5' 


1532 b 


5' 


1533 


5' 


1534 b 


5' 


Resistant 


aene : 



ereB 

-TCT TTT TGT TAC GAC ATA CGC TTT T 
-AGT GCT TCT TTA TCC GCT GTT CTA 

-CAG CGG ATA AAG AAG CAC TAC ACA TT 
-CCT CCT GAA ATA AAG CCC GAC AT 



Originating DNA fragment 



SEQ ID 
NO. 



1535 a 
1535 a 

1535 a 
1535 a 



Nucleotide 
position 



152-176 
456-479 

461-486 
727-749 



1340 
1341 b 

1936 
1937 b 

1942 
1937 b 

2040 
2041 b 



Resistance aene : 



5' 


-GAA 


CAA 


GGT 


ATG 


ACA 


CCG 


GAT 


AAA 


T 


1299 a 


163-188 


5' 


-GAT 


AAC 


TGA 


AAT 


CCT 


GAG 


CCA 


TAC 


G 


1299 a 


274-299 


5' 


-TAC 


CAC 


CCG 


CAC 


GGC 










1954 a 


205-219 


5' 


-CGG 


AGT 


CGC 


CGT 


CGA 


TG 








1954 a 


309-325 


5' 


-GAC 


TGG 


AAC 


AAA 


GCC 


TAT 


AAA 


AAA 


TCA 


1954 a 


148-174 


5' 


-CGG 


AGT 


CGC 


CGT 


CGA 


TG 








1954 a 


309-325 


5' 


-TGT 


GAC 


CCC 


AGA 


CAA 


ACC 


C 






2054 a 


33-51 


5' 


-GTT 


GAG 


CGG 


CAG 


CAC 


TAT 


CT 






2054 a 


207-226 



InhA 



2098 
2D99 b 



5' 
5' 



Resistance crenes: 



-CTG AGT CAC ACC GAC AAA CGT C 
-CCA GGA CTG AAC GGG ATA CGA A 

linA, 2inA' 



2101* 
2101 a 



910-931 
1074-1095 



1537 b 

1538 
1539 b 



d -aua 'ivr ait aaC u-uS~iwT aca"aua"a 
5 ' -CTT TGT AAT TAG TTT CTG AAA ACC A 

5 ' -TTA GAA GAT ATA GGA TAC AAA ATA GAA G 
5' -GAA TGA AAA AGA AGT TGA GCT T 



Resist ance q^n^ - 



HnB 



XD4U— 

1540 a 

1540 a 
1540 a 



352-376 

187-214 
404-425 



1541 
1542 b 

1543 
1544 b 



5' -TGA TAA TCT TAT ACG TGG GGA ATT T 1545* a 

5' -ATA ATT TTC TAA TTG CCC TGT TTC AT 1545 a 

5' -GGG CAA TTA GAA AAT TAT TTA TCA GA 2545a 

5 ' -TTT TAC TCA TGT TTA GCC AAT TAT CA 1545 a 



246-270 
359-384 

367-392 
579-604 



a Sequence from databases. 

These seqaencBS are from the complementary DNA strand 
orxginatxng fragment given. in the Sequence Listing. 



of the sequence of the 
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Annex L: Specxfxc and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance genes 
sequences) (continued). 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Kesn. stance 
1546 


qene: 

5' -CAA 


1547 b 


5'-TAA 


Resistance 


qene : 


1549 


5 ' -GCT 


1550 b 


5 ' -TAG 


Resistance 


aenes : 



mefA 



mefE 



1552 
1553 b 

1554 
1555 b 



me£A, mefE 

5'-GGC AAG CAG TAT CAT TAA TCA CTA 
5' -CAA TGC TAC GGA TAA ACA ATA CTA TC 

5 ' -AGA AAA TTA AGC CTG AAT ATT TAG GAC 
5 '-TAG TAA AAA CCA ATG ATT TAC ACC G 



Resist ance genes : 



mphA, mphK 



1556 
1557 b 



5 ' -ACT GTA CGC ACT TGC AGC CCG ACA T 
5 '-GAA CGG CAG GCG ATT CTT GAG CAT 



5' -GTG GTG GTG CAT GGC GAT CTC T 
5'-GCC GCA GCG AGG TAC TCT TCG TTA 

Resistance g ene: 



1558 
1559 b 



mupA 

2142^ 5'-GCC TTA ATT TCG GAT AGT GC 

2143° 5 ' -GAG AAA GAG CCC AAT TAT CTA ATG T 

Resistance gene : parC 

1342 
1343 b 

1934 
1935 b 

2044 
2045 b 



1548 a 
1548 a 



1551 a 
1551 a 



1548 a 
1548 a 

1548 a 
1548 a 



1560 a 
1560 a 

1560 a 
1560 a 



2144 a 
2144* ~ 



625-646 
816-842 



815-840 
1052-1075 



50-73 
318-343 

1010-1035 
1119-1143 



33-57 
214-237 

583-604 
855-878 



1831-1850 
2002-2026 



5' 


-GAT GTT 


ATT 


GGT 


CAA 


TAT 


CAT 


CCA 




1321 a 


205-229 


5' 


-AAG AAA 


CTG 


TCT 


CTT 


TAT 


TAA 


TAT 


CAC GT 


1321 a 


396-425 


5' 


-GAA CGC 


CAG 


CGC 


GAA 


ATT 


CAA 


AAA 


G 


1781 


67-91 


5' 


-AGC TCG 


GCA 


TAC 


TTC 


GAC 


AGG 






1781 


277-297 


5' 


-ACC GTA 


AGT 


CGG 


CCA 


AGT 


CA 






2055 a 


176-195 


5' 


-GTT CTT 


TCT 


CCG 


TAT 


CGT 


C 






2055 a 


436-454 



a Sequence from databases . 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequenc Listing. 
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Aim ^^ : Sp ci f ic . ^i^itoue primers for nucleic acid 

amplification (antimicrobial agents r sistance genes seouences) 
(continued) . 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance 


qene: 




ppflo-like 












2163 
2164 b 


5 ' -ACC 
5' -CAA 


TTC 
CGA 


ATC CTA CCG ATG 
CAC CAG CAC TGC 


TGG 
CAT 


GTT 
TG 




2165 a 
2165 a 


922-945 
1136-1158 


Resistance 


..gene : 




xpoB 












2065 
2066 b 


5 ' —CCA 
5 ' -CAC 


CGA 


CtsX CafcxA GGC GAT 
CAG CGA GCC GAT 


CAC 
CAG 


A 
A 




2072 a 
2072 a 


1218-1239 
1485-1506 


Resistance 


crene : 




satG 












1581 
1582 b 


5'-AAT 
5 ' -GGC 


TGG 
AAA 


GGA CTA CAC CTA 
TCA GTC AGT TCA 


TTA 
GGA 


TGA 
GT 


TG 


1585 a 
1585 a 


93-118 
310-332 


1583 
1584 b 


5' -CGA 
5 ' -TCA 


TTG 
CCT 


GCA ACA ATA CAC 
ATT TTT ACG CCT 


TCC 
GGT 


TG 
AGG 


AC 


1585 a 
1585 a 


294-316 
388-413 


Resistance 


crene: 




sulZX 












1961 
1962 b 


5 ' -GCT 
5' -GGA 


CAA 
CAA 


GGC AGA TGG CAT 
GGC GGT TGC GTT 


TCC 
TGA 


C 
T 




1965 a 
1965 a 


222-243 
496-517 


1963 
1964 b 


5 ' -CAT 
5' -ATC 


TCC 
TGC 


CGT CTC GCT CGA 
CTG CCC GTC TTG 


CAG 
C 


T 




1965 a 
1965 a 


237-258 
393-411 


Resistance 


qene: 




tetB 












1966 
1967 b 


5 ' -CAT 
5 ' -CAG 


GCC 
CAA 


AGT CTT GCC AAC 
TAA GTA ATC CAG 


G 

CGA 


TG 




1970 a 
1970 a 


66-84 
242-264 



1968 
1969 b 



5 * -GGA GAG ATT TCA CCG CAT AG 
5'-AGC CAA CCA TCA TGC TAT TCC A 



1970 a 
1970 a 



457-476 
721-742 



Resistance crene 



tetM 



1586 
1587 b 

1588 
1589 b 



5 ' -ATT CCC ACA ATC TTT TTT ATC AAT AA 1590 a 

5 '-CAT TGT TCA GAT TCG GTA AAG TTC 1590 a 

5 ' -GTT TTT GAA GTT AAA TAG TOT TCT T 1590 a 

5' -CTT CCA TTT GTA CTT TCC CTA 1590 a 



361-386 
501-524 

957-981 
1172-1192 



a Sequence from databases. 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 



337 



WO 01/23604 



PCT/CA00/01150 



Annex L: Specific and ubiquitous primers for nucleic acid 
amplification (antimicrobial agents resistance aenes 
sequences) (continu d) . y 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Resistance g gm^: 



vatB 



1609 
1610 b 

1611 
1612 b 



5' -GCC CTG ATC CAA ATA GCA TAT A 
5 ' -CCT GGC ATA ACA GTA ACA TTC TG 

5'-TGG GAA AAA GCA ACT CCA TCT C 
5 ' -ACA ACT GAA TTC GCA GCA ACA AT 



Resistance q^n^ : 



vatC 



1614 
1615 b 

1616 
1617 b 



5 '-CCA ATC CAG AAG AAA TAT ACC C 
5 ' -ATT AGT TTA TCC CCA ATC AAT TCA 

5 '-ATA ATG AAT GGG GCT AAT CAT CGT AT 
5 ' -GCC AAC AAC TGA ATA AGG ATC AAC 



Resist ance cr^n<=>; 



1619 


5' 


-AAG 


1620 b 


5' 


-TGT 


1621 


5' 


-AAT 


1622 b 


5' 


-CGA 


Resistance. 


aene: 





vga 



1624 
1625 b 



vgaB 

5' -TTC TTT AAT GCT CGT AGA TGA ACC TA 
5' -TTT TCG TAT TCT TCT TGT TGC TTT C 



1626 
1627 b 



5' -AGG AAT GAT TAA GCC CCC TTC AAA AA 
5 ' -TTA CAT TGC GAC CAT GAA ATT GCT CT 
Resist ance gem <=>.<=; : vgrb, vgh 



1629 
1630 b 

1631 
1632 b 



5' -AAG GGG AAA GTT TGG ATT ACA CAA CA 
5' -GAA CCA CAG GGC ATT ATC AGA ACC 

5 ' -CGA CGA TGC TTT ATG GTT TGT 

5' -GTT AAT TTG CCT ATC TTG TCA CAC TC 



1613 a 
1613 a 

1613 a 
1613 a 



16l8 a 
1618 a 

1618 a 
1618 a 



1623 a 
1623 a 

1623 a 
1623 a 



1628 a 
1628 a 

1628 a 
1628 a 



1633 a 
1633 a 

1633 a 
1633 a 



Nucleotide 
position 



11-32 
379-401 

301-322 
424-446 



26-47 
177-200 

241-266 
463-486 



641-664 
821-843 

843-868 
975-1000 



354-379 
578-602 

663-688 
849-874 



73-98 
445-468 

576-596 
850-875 



a Sequence from databases, 
b 



es arS f *° m the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. ^ 
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LS SSSJL^-™* / ubi ^ to ^ s . PrimerB for nucleic acid 
amplification (antimicrobial agents resistance qenes 
sequences) (continued). genes 



SEQ ID NO. 



Originating DNA fragment 



Nucleotide sequence 



Resistance g pn^- 



vgbB 



1634 
1635 b 

1636 
1637 b 



5' -TTA ACT TGT CTA TTC CCG ATT CAG G 
5'-GCT GTG GCA ATG GAT ATT CTG TA 

5 ' -TTC CTA CCC CTG ATG CTA AAG TGA 
5' -CAA AGT GCG TTA TCC GAA CCT AA 



Resistance q^r><=> 



Sequencing primers 
gyrA 



Resistance 

1302 b 

1303 

1304 b 

1305 
1306 b 



g.^ne : QyrB 

^Ir^L^J...^ 1 MQT CCT H.qT ATf: TA 
5'-TAI ADI GGI GGI KKI GCI ATR TA 

5 ' -GGI GAI GAI DYI MGI GAR GG 

5 ' -CIA RYT TIK YIT TIG TYT G 

5' -ATG GTG ACT GCA TTG TCA GAT G 
5' -GTC TAC GGT TTT CTA CAA CGT C 



SEQ ID 
NO. 



1882 a 
1882 a 

1882 a 
1882 a 



13 07 a 

1307 a 

1307 a 

13 07 a 
1307 a 



Nucleotide 
position 



23-47 
267-289 

155-178 
442-464 



1290 
1292 b 


5' 
5' 


-GAY 
-ARI 


TAY 
SCY 


GCI 
TCI 


ATG 
ARI 


ISI 
ATR 


GTI 
TGI 


ATH 
GC 


GT 


1299 a 
1299 a 


70-83 
1132-1152 


1291 
1292 b 


5' 
5' 


-GCI 
-ARI 


YTI 
SCY 


CCI 
TCI 


GAY 
ARI 


GTI 
ATR 


MGI 
TGI 


GAY 
GC 


GG 


1299 a 
1299 a 


100-123 
1132-1152 


1293 
1294 b 


5' 
5' 


-ATG 
-ATG 


GCT 
ATT 


GAA 
GTT 


TTA 
GTA 


CCT 
TAT 


CAA 
CTT 


TC 
CTT 


CAA C 


1299 a 
1299 a 


1-21 
2626-2651 


1295 b 


5' 


-CAG 


AAA 


GTT 


TGA 


AGC 


GTT 


GT 




1299 a 


1255-1275 


1296 


5' 


-AAC 


GAT 


TCG 


TGA 


GTC 


AGA 


TA 




1299 a 


1188-1208 


1297 
1298 b 


5' 
5' 


-CGG TCA 
-ACG AAA 


ACA 
TCG 


TTG 
ACC 


AGG 
GTC 


AAG AGC 
TCT TTT 


T 
TC 


13 00 a 
1300 a 


29-51 
415-437 



1600-1623 

955-975 

1024-1043 

1-23 
1858-1888 



a Sequence from databases. 

b SSLiSEffJSL?**.. fr -° m tt ? e com PlementatY DMA strand 
originating fragment given in the Sequence Listing. 



of the sequence of the 
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Annex L: Specific and ubiquitous 

amplification C«aS2SeSSlal ^EL *° T ™<=leic acid 
secjuences) (continued) agents resistance genes 



SEQ ID NO. 



Nucleotide 



sequence 



Originating DNA fragment 



Sequencing primers (continued) 



pare 



1308 
1309 b 

1310 

1311 b 

1312 
1313 b 

1314 

1315 b 

1316 
1317 b 

1318 
1319 b 



5' -ATG TAY GTI ATI ATG GAY MGI GC 
-ATI ATY TTR TTI CCY TTI CCY TT 

5 ' -ATI ATI TSI ATI ACY TCR TC 

5'^GAR ATG AAR ATI MGI GGI GAR CA 

5'-AAR TAY ATI ATI CAR GAR MGI GC 
5 '-AMI AYI CKR TGI GGI TTI TTY TT 

5-TAI GAI TTY ACI GAI SMI CAR GC 

5' -ACI ATI GCI TCI GCY TGI KSY TC 

5 '-GTG AGT GAA ATA ATT CAA GAT T 
5'-CAC CAA AAT CAT CTG TAT CTA C 

5 -ACC TAY TCS ATG TAC GTR ATC ATG GA 
5 -AGR TCG TCI ACC ATC GGY AGY TT 



parE 



1322 
1325 b 



1323 



1326 
1327 b 



5-RTI GAI AAY ISI GTI GAY GAR G 
5'-RTT CAT YTC ICC IAR ICC YTT 

5 ' -ACI AWR SAI GGI GGI ACI CAY G 

5'-TGA TTC AAT ACA GGT TTT AGA G 
5'-CTA GAT TTC CTC CTC ATC AAA T 



SEQ id 
NO. 



1320* 
1320 a 

1320 a 

1320 a 

132l a 
132l a 

132l a 

1321 a 

132l a 
132l a 

1320 a 
1320 a 



1328 a 
1328 a 

1328 a 

1328 a 
1328 a 



Nucleotide 
position 



67-90 
1993-2016 

1112-1132 

1288-1311 

67-90 
2212-2235 

1228-1251 

1240-1263 

1-23 
2356-2378 

58-84 
832-855 



133-155 
1732-1752 

829-850 



27-49 
1971-1993 



Sequence from databases. 



the sequence of the 
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Annex LI: internal hybridization probes for specific detection of 
antimicrobial ag nts resistance genes sequences 
(continued) • 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Resistance gene: blaTEM (continued) 



2007 


5 ' -TGG 


AGC 


CAG TGA 


GCG 


TGG 






2010 a 


699-716 


2008 


5 ' -TCT 


GGA 


GCC GAT 


GAG 


CGT 


G 




1929 a 


697-715 


2009 


5 ' -CTG 


GAG 


CCA GTA 


AGC 


GTG 


G 




2011 a 


698-716 


2141 


5 ' -CAC 


CAG 


TCA CAG 


AAA 


AGC 






1927 a 


311-328 


Resistance 


crene : 




dhfrla 












2253 


5 ' —CAT 


TAP 


CCA ACC 


GAA 


AGT 


TV 

A 




14ol cl 


158-176 


Resistance 


QRne: 




embB 














2104 


5 ' -CTG 


GGC 


ATG GCI 


CGA 


GTC 






2105 a 


910-927 


Resistance 


crene: 


















1333 


5 ' -TCA 


TGG 


TGA CTT 


ATC 


TAT 


TTA 


TG 


1299 a 


240-263 


1334 


5 ' -CAT 


CTA 


TTT ATA 


AAG 


CAA 


TGG 


TA 


1299 a 


251-274 


1335 


5 ' -CTA 


TTT 


ATG GAG 


CAA 


TGG 


T 




1299 a 


254-273 


1940 


5 ' -GTA 


TCG 


TTG GTG 


ACG 


TAA 


T 




1299 a 


206-224 


1943 


5 ' -GCT 


GGT 


GGA CGG 


CCA 


G 






1954 a 


279-294 


1945 


5 ' -CGG 


CGA 


CTA CGC 


GGT 


AT 






1954 a 


216-232 


1946 


5 ' -CGG 


CGA 


CTT CGC 


GGT 


AT 






1954 a 


216-232 


1947 


5 ' -CGG 


TAT 


ACG GCA 


CCA 


TCG 


T 




1954 a 


227-245 


1948 


5 ' -GCG 


GTA 


TAC AAC 


ACC 


ATC 


G 




1954 a 


226-244 


1949 


5 ' -CGG 


TAT 


ACG CCA 


CCA 


TCG 


T 




1954 a 


227-245 


2042 


5 ' -CAC 


GGG 


GAT TTC 


TCT 


ATT 


TA 




2054 a 


103-122 


2043 


5 ' -CAC 


GGG 


GAT TAC 


TCT 


ATT 


TA 




2054 a 


103-122 



Resistance oene : i nhfl 



2100 5' -GCG AGA CGA TAG GTT GTC 2101 a 1017-1034 

Resistance gene : parC 



1336 


5 * -TGG 


AGA 


CTA 


CTC 


AGT 


GT 


1321 a 


232-249 


1337 


5' -TGG 


AGA 


CTT 


CTC 


AGT 


GT 


1321 a 


232-249 


1338 


5' -GTG 


TAC 


GGA 


GCA 


ATG 




1321 a 


245-260 


1339 


5 ' -CCA 


GCG 


GAA 


ATG 


CGT 




1321 a 


342-357 


1941 


5 ' -GCA 


ATG 


GTC 


CGT 


TTA 


AGT 


1321 a 


253-270 


1944 


5 ' -TTT 


CGC 


CGC 


CAT 


GCG 


TTA C 


1781 


247-265 


1950 


5' -GGC 


GAC 


ATC 


GCC 


TGC 




1781 


-137-151 


1951 


5' -GGC 


GAC 


AGA 


GCC 


TGC 


TA 


1781 


137-153 



a Sequence from databases. 
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Annex Li 




1886 

1887 

1888 

1889 

1890 

1891 

1892 

1893 

1894 

1895 

1896 

1897 

1898 



5' -GAC GCC 
5' -GAC GCC 
5' -GAC GCC 
5'-GTT CGC 
5' -TTC GCA 
5'-CCG GAG 
5' -CGG AGC 
5'-GGA GCT 
5 ' -GAC CGG 
5' -CGG AGC 
5' -GAA ACG 
5 '-CAT TAC 
5 ' -CCA TTA 



Resist ance q <smg>« 



CGC GCC ACC 
CGC GAC ACC 
CGC AAC ACC 
AAC TGC AGC 
ACG GCA GCT 
CTG CCG AIC 
TGC CAA RCG 
GGC GAR CGG 
AGC TAG CGA 
TAG CAA RCG 
GAA CTG AAT 
CAT GGG CGA 
CCA TGA GCG 

hlaTEM 



ACT 

ACT A 

ACT A 

TGC TG 

GCT G 

GGG 

GGG 

GGT 
RCG 
GGG T 
GAG GCG 
TAA CAG 
ATA ACAG 



1909 
1910 
1911 
1912 
1913 
1914 
1915 

1916 

1917 

1918 

1919 

1920 

1921 

1922 
1923 
1924 
1925 
1926 
2006 



5' -ATG ACT TGG TTA AGT ACT CAC C 
5' -ATG ACT TGG TTG AGT ACT CAC C 
5' -CCA TAA CCA TGG GTG ATA ACA C 
5 ' -CCA TAA CCA TGA GTG ATA ACA C 
5 ' -CGC CTT GAT CAT TGG GAA CC 
5 ' -CGC CTT GAT CGT TGG GAA CC 
5' -CGC CTT GAT AGT TGG GAA CC 
5' -CGT GGG TCT TGC GGT ATC AT 
5' -CGT GGG TCT GGC GGT ATC AT 
5 ' -GTG GGT CTC ACG GTA TCA TTG 
5 ' -CGT GGG TCT CTC GGT ATC ATT 
5 ' -CGT GGI TCT CGC GGT ATC AT 
5 ' -CGT GGG TCT AGC GGT ATC ATT 
5'-GTT TTC CAA TGA TTA GCA CTT TTA 
5'-GTT TTC CAA TGA TAA GCA CTT TTA 
5'-GTT TTC CAA TGC TGA GCA CTT TT 
5 ' -CGT TTT CCA ATG ATG AGC ACT TT 
5'-GTT TTC CAA TGG TGA GCA CTT TT 
5 ' -TGG AGC CGG TGA GCG TGG 



1406 a 



1900 a 

1899 a 

1901 a 

1899 a 

1899 a 

1902 a 

1903 a 

1899 a 

1904 a 

1905 a 

1899 a 

1899 a 

1899 a 



1928 a 
1927 a 
1928 a 
1927 a 
1928 a 
1927 a 
1929 a 
1927* 
1930 a 
1927 a 
1931 a 
1927 a 
1932 a 
1927 a 
1927 a 
1932 a 
1927 a 
1933 a 
1927 a 



149-166 



484-501 
514-532 
514-532 

593- 612 

594- 612 

692- 709 

693- 710 

694- 711 
690-707 
693-711 
484-504 
366-386 
365-386 



293-314 
293-314 
371-392 
371-392 
475-494 
475-494 
475-494 
712-731 

712- 731 

713- 733 
712-732 

712- 731 

713- 733 
188-211 
188-211 
188-210 

187- 209 

188- 210 
699-716 
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Internal hybridization probes for specific detection of 
antimicrobial agents resistance genes sequences 
(continued) . 



SEQ ID NO. 


Nucleotide 


sequence 

- 


Originating DNA fragment 

SEQ ID Nucleotide 
NO. position 


Resistance 


gene: 




pare (continued) 






1952 


5 ' -CCT 


GCT 


ATG GAG CGA TGG T 


1781 


147-165 




5 ' -CGC 


CTG 


CTA TAA AGC GAT GGT 


1781 


145-165 


2046 


5 ' -ACG 


GGG 


ATT-TTT CTA TCT AT 


2055 a 


227-246 


Resistance 


crene : 




rpoB 






2067 


5 ' -AGC 


TGA 


GCC AAT TCA TGG 


2072 a 


1304-1321 


2068 


5 ' -ATT 


CAT 


GGA CCA GAA CAA C 


2072 a 


1314-1332 


2069 


5 ' •CGC 


TGT 


CGG GGT TGA CCC 


2072 a 


1334-1351 


2070 


5'-GTT 


GAC 


CCA CAA GCG CCG 


2072 a 


1344-1361 


2071 


5 ' -CGA 


CTG 


TCG GCG CTG GGG 


2072 a 


1360-1377 


Resistance., 


aene: 




tetM 






2254 


5 ' -ACC 


TGA 


ACA GAG AGA AAT G 


1590 a 


1062-1080 



Sequence from databases. 
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Annex LII: Molecular beacon internal hybridization probes for 
specific detection of atpD sequences. 



SEQ ID NO. Nucleotide sequence 3 



Originating DNA fragment 

SEQ XD Nucleotide 
NO. position 



Bacterial species: 
2136 



Bacteroides fragilis 



5 9 -CCA ACS CGT CCT CAA TCA TTT CTA ACT TCT 929 
ATG GCC GGC GTT SQ 



Bacter ial sp er-ips? 
2182 



Bordetella pertussis 



5' -QQG £££ CAA CGA CTT CTA CCA CGA AAT GGA 1672 
AGA GTC£CS CGC 



Bacterial croup ; 
2133 



Caj^pylobacter jejuni and coli 



5 ' "CCA CGC ACA WAA ACT TGT TTT AGA AGT 1576, 
AGC AGC WCA CCG TQQ 1600,1849, 

1863,2139 b ' c 



353-382 



576-605 



44-73 d 



Funaal species 
2078 



Candida, glabrata 



5' -CCS MsC CTT GGT CTT CGG CCA AAT GAA CGC 463 
TCG Q 



Funaal spec i eg 
2075 



Candida krusei 



5 ' -CCG AgC CAG GTT CTG AAG TCT CTG CAT TAT 468 
TAG GTS CTQ SG 



Fungal species: 
2080 



Candida lusitaniae 



5 ' -CCG AGC CGA AGA GGG CCA AGA TGT C£C 2CS 470 
G 



Funaal species 
2079 



Candida parapsilosis 



5' -£CG &GC GTT CAG TTA CTT CAG TCC AAG CCG 472 
GCT CGG 



442-463 



720-748 



520-538 



837-860 



Funaal species 
2077 



Candida tropicalis 



5' -CCS AGC AAC CGA TCC AGC TCC AGC TAC GCT 475 

CGg 

Bacterial specie- Klebsiella pneumoniae 

2281 5'-CCC CCA GCT GGG CGG CGG TAT CGA TCC CSS 317 

* Underlined nucleotides indicate the molecular beacon's stem. 

Sequence from databases. 
*j These sequences were aligned to derive the corresponding primer. 

2? 1576? Dt:i<5e positions refer to c - Jejuni atpD sequence fragment (SEQ ID 



877-897 



40-59 
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Annex LilXz Molecular beacon internal hybridization probes for 
specific detection of atpJD s quences (continued) • 

Originating DMA fragment 



SEQ ID NO. Nucleotide seguence a S E q xd 



Nucleotide 
NO. position 



Fungal genus: Candida sp. 

2076 5'-CC£ AGC YGA YAA CAT TTT CAG ATT CAC CCA 460-478, 6Q7-723 C 

RGC GQZ £G£ 663 b 

<* Underlined nucleotides indicate the molecular beacon's stem. 

These sequences were aligned to derive the corresponding primer. 
° ™\,S UC ifJ?f ide P° sitions ref er to the C. albicans atpD sequence fragment (SEQ 

iU NO. 4o0) . 
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Annex LI 1 1: Internal hybridization prob s for specific detection of 
atpD sequences. 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species: AcinetoJbacter baumannii 

2169 5'-CCC GTT TGC GAA AGG TGG 

Bacterial s-becies: Klebsiella pneumoniae 

2167 5'-CAG CAG CTG GGC GGC GGT 



243 304-321 



317 36-53 
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Annex LIV: Internal hybridization probes for specific d tection of 
ddl and mtl sequences . 

~~ Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species: Knterococcus faecium (ddl) 

2286 5'-AGT TGC TGT ATT AGG AAA TG 2288 a 784-803 

2287 5'-TCG AAG TTG CTG TAT TAG GA 2288 a 780-799 

Bacterial species: Enterococcus faecalis mtl) 

2289 5'-CAC CGA AGA AGA TGA AAA AA 

2290 5'-TGG CAC CGA AGA AGA TGA 

2291 5 ' -ATT TTG GCA CCG AAG AAG A 



Sequence from databases. 



1243 a 264-283 
1243 a 261-278 
1243 a 257-275 
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What is claimed is: 

1 . A method for generating a repertory of nucleic acids of tuf, fits, atpD 
and/or recA genes from which are derived probes or primers, or both, useful for 
the detection of one, more than one related microorganisms, or substantially all 
microorganisms of a group selected from algae, archaea, bacteria, fungi and 
parasites, which comprises the step of: 

amplifying the nucleic acids of a plurality of determinedalgal, 
archaeal, bacterial, fungal and parasitical species with any combination of 
the primer pairs defined in SEQ ID NOs.: 543, 556-574, 636-655, 664, 
681-683, 694, 696-697, 699-700, 708, 812-815, 911-917, 919-922, 935- 
938, 1203-1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999- 
2003, 2282-2285. 



2. A method for generating a repertory of nucleic acid sequences, which 
comprises the steps of: 

reproducing the method of claim 1, and 
adding the step of: 

sequencing said nucleic acids. 

3. A method for generating sequences of probes, or primers, or both, 
useful for the detection of one, more than one related microorganisms, or 
substantially all microorganisms of a group selected from algae,archaea, bacteria, 
fungi and parasites, which comprises the steps of: 

. reproducing the method of claim 2, and 
adding the steps of: 

aligning a subset of nucleic acid sequences of said 

repertory, 

locating nucleic acid stretches that are present in the 
nucleic acids of strains or representatives of said one, more than one 
related microorganisms, or substantially all microorganisms of said 
group, and not present in the nucleic acid sequences of other 
microorganisms, and 
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deriving consensus nucleic acid sequences useful as 
probes or primers from said stretches. 

4. A bank of nucleic acids comprising the repertory of nucleic acids 
obtained from the method of claim 1 . 

5. A bank of nucleic acid sequences comprising the repertory of nucleic 
acid sequences obtained from the method of claim 2. 

6. A method for generating sequences of probes, or primers, or both, 
useful for the detection of one, more than one related microorganisms, or 
substantially all microorganisms of a group selected from algae,archaea, bacteria, 
fungi and parasites, which comprises the steps of. 

aligning a subset of nucleic acid sequences of the bank as 
defined in claim 5, 

locating nucleic acid sequence stretches that are present in the 
nucleic acid sequences of strains or representatives of said one, more than 
one related microorganisms, or substantially all microorganisms of said 
group, and not present in the nucleic acid sequences of other 
microorganisms, and 

deriving consensus nucleic acid sequences useful as probes or 
primers from said stretches. 

7. A method for generating probes, or primers or both, useful for the 
detection of one, more than one related microorganisms, or substantially all 
microorganisms of a group selected from algae, archaea, bacteria, fungi and 
parasites, which comprises the steps of: 

reproducing the method of claim 3 or 6, and 
adding the step of: 

synthesising said probes or primers upon the nucleic acid 
sequences thereof. 

8. A nucleic acid used for universal detection of any one of alga, 
archaeon, bacterium, fungus and parasite which is obtained from the method of 
claim 7. 
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9. A nucleic acid used for universal detection as set forth in claim 8, 
which has a nucleic acid sequence of at least 12 nucleotides capable of 
hybridizing with said any one of alga, archaeon, bacterium, fungus and parasite 
and with any one of SEQ ID NOs.: 543, 556-574, 636-655, 658-661, 664, 681- 
683, 694, 696, 697, 699, 700, 708, 812-815, 911-917, 919-922, 935-938, 1203- 
1207, 1212-1213, 1221-1229, 1605-1606, 1974-1984, 1999-2003,2282-2285. 

10. A nucleic acid used for the specific and ubiquitous detection and for 
identification of any one of a algal, archaeal, bacterial, fungal and parasitital 
species, genus, family and group, which is obtained from the method of claim 7. 

11. A nucleic acid as set forth in claim 10 having any one of the 
nucleotide sequences which are denned in SEQ ID NOs.: 

539, 540 for the detection and/or identification of Mycobacteriaceae 

family 

541, 542, 544, for the detection and/or identification of Pseudomonads 

2121 group 

545, 546 for the detection and/or identification of Corynebacterium 

sp. 

547, 548, 1202 for the detection and/or identification of Streptococcus sp. 
549, 550, 582, 583, for the detection and/or identification of Streptococcus 
625, 626, 627, 628, agalactiae 
1199 

551,552,2166, for the detection and/or identification of Neisseria 
2173, 217 '4, 2175, gonorrhoeae 
2116,2171, HIS, 
2179 

553, 575, 605, 606, for the detection and/or identification of StapJrylococcus sp. 
707, 1175,1176 

554,555,2213 for the detection and/or identification of Chlamydia 

trachomatis 
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576, 63 1 , 632, 633, for the detection and/or identification of Candida sp. 
634,635,1163, 

1164, 1167, 2076, 
2108,2109 

577, 1 1 5 6, 1 1 60 for the detection and/or identification of Candida albicans 
2073 

578, 1 1 66, 1 1 68, for the detection and/or identification of Candida dubliniensis 
2074 

579, 2 1 68 for the detection and/or identification of Escherichia coli 
580,603, 1174, for the detection and/or identification of Enterococcus 
1236, 1238, 2289, faecalis 

2290, 2291 

581 for the detection and/or identification of Haemophilus 

influenzae 

584,585,586,587, for the detection and/or identification of Staphylococcus 

588, 1232, 1234, aureus 

2186 

589,590,591,592, for the detection and/or identification of Staphylococcus 
593 epidermidis 

594, 595 for the detection and/or identification of Staphylococcus 

haemolyticus 

596, 597, 598 for the detection and/or identification of Staphylococcus 

hominis 

599,600,601,695, for the detection and/or identification of Staplrylococcus 
1208, 1209 saprophytics 

602, 1235, 1237, for the detection and/or identification of Enterococcus 
1696, 1697, 1698, faecium 
1699, 1700, 1701, 
2286, 2287 

604 for the detection and/or identification of Enterococcus 

gallinarum 

620, 1 122 for the detection and/or identification of Enterococcus 

casseliflavus, E.flavescens and E. gallinarum 
629,630,2085, for the detection and/or identification of Chlamydia 
2086, 2087, 2088, pneumoniae 
2089, 2090, 2091, 
2092 
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Sa' 639 ' *P detection and/or identification of at least the 

640,641,642 following: 

Abiotrophia adiacens, Abiotrophia defectiva, Acinetobacter 
baumannii, Acinetobacter Iwqffi, Aerococcus viridans 
Bacillus anthracis, Bacillus cereus, Bacillus subtilis, 
Brucella abortus, Burkholderia cepacia, Citrobacter 
mversus, Citrobacter freundii, Enterobacter aerogenes, 
Enterobacter agglomerans, Enterobacter cloacae, 
Enterococcus avium, Enterococcus casseliflavus, 
Enterococcus dispar, Enterococcus durans, Enterococcus 
faecalis, Enterococcus faecium, Enterococcus flavescens, 
Enterococcus gallinarum, Enterococcus mundtii, 
Enterococcus raffinosus, Enterococcus solitarius, 
Escherichia coli, Gemella morbillorum, Haemophilus 
ducreyi, Haemophilus haemolyticus, Haemophilus 
influenzae, Haemophilus parahaemolyticus, Haemophilus 
parainfluenzae, Hafnia atvei, Kingella Mngae, Klebsiella 
owtoca, Klebsiella pneumoniae, Legionella pneumophila, 
Megamonas hypermegale, Moraxelta atlantae, Moraxella 
catarrhalis, Morganella morganii, Neisseria gonorrheae, 
Neisseria meningitidis, Pasteurella aerogenes, Pasteurella 
multocida, Peptosti-eptococcus magnus, Proteus mirabilis, 
Providencia alcalifaciens, Providenciarettgeri, Providencia 
rustigianii, Providencia stuartii, Pseudomonas aeruginosa, 
Pseudomonas fluorescens, Pseudomonas stutzeri, 
Salmonella bongori, Salmonella choleraesuis, Salmonella 
enteritidis, Salmonella gallinarum, Salmonella 
typhimurium, Serratia liquefaciens, Serratia marcescens, 
Shigella Jlexneri, Shigella sonnei, Staphylococcus aureus, 
Staphylococcus capitis Staphylococcus epidermidis, 
Staphylococcus haemolyticus, Staphylococcus hominis, 
Staphylococcus lugdunensis, Staptynococcus sapropJryticus, 
Staphylococcus simulans, Stapltylococcus wameri, 
Stenotrophomonas maltophitia, Streptococcus 

acidominimus, Streptococcus agalactiae, Streptococcus 
anginosus, Streptococcus bovis, Streptococcus constellatus, 
Streptococcus cricetus, Streptococcus cristatus, 
Streptococcus dysgalactiae, Streptococcus equi, 
Streptococcus ferus, Sti'eptococcus gordonii, Streptococcus 
intermedius, Streptococcus macacae, Streptococcus mitis, 
Streptococcus /nutans, Streptococcus oralis, Streptococcus 
parasanguinis, Streptococcus parauberis, Streptococcus 
pneumoniae, Streptococcus pyogenes, Streptococcus ratti, 
Streptococcus salivarius, Streptococcus sanguinis, 
Streptococcus sobrinus, Streptococcus uberis, Streptococcus 
vestibularis, Vibrio cholerae, Yersinia enterocolitica, 
Yersinia pestis, Yersinia pseudotuberculosis. 

656, 657, 27 1 , for the detection and/or identification of Enterococcus sp. 

1136,1137 

70 1 , 702 for the detection and/or identification of Leishmania sp. 
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703, 704, 705, 706, 
793 

794, 795 

796, 797, 808, 809, 
810, 811 

798, 799, 800, 801, 
802, 803, 804, 805, 
806, 807 

816, 817, 818, 819 
820, 821, 822 

823, 824 
825, 826 

923, 924, 925, 926, 
927, 928 
933, 934 

994, 995, 996, 997, 
998, 999, 1000, 
1001, 1200, 1210, 
1211 

1157, 2079,2118 

1158, 1159, 2078, 
2110,2111 
1160, 2077,2119, 
2120 

1161,2075,2112, 

2113,2114 

1162 

1162, 2080,2115 

2116,2117 

1165 

1201 



for the detection and/or identification of Entamoeba sp. 

for the detection and/or identification of Trypanosoma cruzi 
for the detection and/or identification of Clostridium sp. 

for the detection and/or identification of Cryptosporidium 
parvum 

for the detection and/or identification of Giardia sp. 

for the detection and/or identification of Trypanosoma 

brucei 

for the detection and/or identification of Trypanosoma sp. 
for the detection and/or identification of Bordetella sp. 
for the detection and/or identification of Trypanosomatidae 
family 

for the detection and/or identification of Enterobacteriaceae 
group 

for the detection and/or identification of Streptococcus 
pyogenes 



for the detection and/or identification of Candida 
parapsilosis 

for the detection and/or identification of Candida glabrata 

for the detection and/or identification of Candida tropicalis 

for the detection and/or identification of Candida krusei 

for the detection and/or identification of Candida 
guilliermondii 

for the detection and/or identification of Candida lusitaniae 

for the detection and/or identification of Candida 
zeylanoides 

for the detection and/or identification of Streptococcus 
pneumoniae 
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1233 

1329, 1330, 1331, 
1332,2167, 2281 
1661, 1665 

1690, 1691, 1692, 
1693,2169 
1694, 1695,2122 

1971, 1972, 1973 
2081,2082, 2083 
2084 

2093, 2094, 2095, 
2096 

2106,2107 

2131,2132,2133 

2134,2135,2136 
2170 

2171 
2172 

2180,2181,2182 



for the detection and/or identification of Staphylococcus sp. 
other than S. aureus 

for the detection and/or identification of Klebsiella 
pneumoniae 

for the detection and/or identification of Escherichia coli 
and Shigella sp. 

for the detection and/or identification of Acinetobacter 
baumanii 

for the detection and/or identification of Pseudomonas 
aeruginosa 

for the detection and/or identification of Cryptococcus sp. 
for the detection and/or identification of Legionella sp. 
for the detection and/or identification of Legionella 
pneumophila 

for the detection and/or identification of Mycoplasma 
pneumoniae 

for the detection and/or identification of Cryptococcus 
neoformans 

for the detection and/or identification of Campylobacter 
jejuni and C. coli 

for the detection and/or identification of Bacteroides fragilis 
for the detection and/or identification of Abiotrophia 
adiacens 

for the detection and/or identification of Gemella sp. 

for the detection and/or identification of Enterococcus sp., 

Gemella sp., A. adiacens 

for the detection and/or identification of Bordetella 
pertussis. 



12. A method for detecting the presence in a test sample of a 
microorganism that is an alga, archaeum, bacterium, fungus or parasite, which 
comprises: 

a) putting in contact any test sample tuf or atpD or recA nucleic acids 
and nucleic acid primers and/or probes, said primers and/or probes having 
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been selected to be sufficiently complementary tohybridize to one or more 
tuf or atpD or recA nucleic acids that are specific to said group of 
microorganisms; 

b) allowing the primers and/or probes and any test sample fw/ or 
atpD or recA nucleic acids to hybridize under specified conditions such as 
said primers and/or probes hybridize to the tuf or atpD or recA nucleic 
acids of said microorganism and does not detectably hybridize to tuf or 
atpD or recA sequences from other microorganisms; and, 

c) testing for hybridization of said primers and/or probes to any 
test sample tuf ox atpD or recA nucleic acids. 

13. The method of claim 12 wherein c) is based on a nucleic acid target 
amplification method. 

14. The method of claim 12 wherein c) is based on a signal amplification 
method. 

15. The method of any one of claims 12 to 14 wherein said primers 
and/or probes that are sufficiently complementary are perfectly complementary. 

16. The method of any one of claims 12 to 14 wherein said primers 
and/or probes that are sufficiently complementary are not perfectly 
complementary. 

17. A method for the specific detection and/or identification of a 
microorganism that is an algal, archaeal, bacterial, fungal or parasitical species, 
genus, family or group in any sample, using a panel of probes or amplification 
primers or both, each individual probe or primer being derived from a nucleic 
acid which has a nucleotide sequence of at least 12 nucleotides in length capable 
of hybridizing with the nucleic acids of said microorganism and with a nucleic 
acid having any one of the nucleotide sequences defined in SEQ ID NOs.: 

539, 540 for the detection and/or identification of Mycobacteriaceae 

family 

54 1 , 542, 544, 2121 for the detection and/or identification ofPseudomonads group 
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545, 546 for the detection and/or identification of Corynebacterium 

sp. 

547, 548, 1202 for the detection and/or identification of Streptococcus sp. 
549, 550, 582, 583, for the detection and/or identification of Streptococcus 

625, 626, 627, 628, agalactiae 
1199 

551,552,2166, for the detection and/or identification of Neisseria 

2173,2174,2175, gonorrhoeae 

2176,2177,2178, 

2179 

553, 575, 605, 606, for the detection and/or identification of Staphylococcus sp. 
707, 1175, 1176 

554,555,2213 for the detection and/or identification of Chlamydia 

trachomatis 

576, 63 1 , 632, 633, for the detection and/or identification of Candida sp. 
634,635,1163, 

1164, 1167, 2076, 
2108,2109 

577, 1 1 56, 1 1 60 for the detection and/or identification of Candida albicans 
2073 

578,1166,1168, for the detection and/or identification of Candida 

2074 dubliniensis 

579, 2 1 68 for the detection and/or identification of Escherichia coli 

580,603,1174, for the detection and/or identification of Enterococcus 

1236, 1238, 2289, faecalis 
2290, 2291 

581 for the detection and/or identification of Haemophilus 

influeitzae 

584,585,586,587, for the detection and/or identification of Staphylococcus 

588,1232,1234, aureus 

2186 

589,590,591,592, for the detection and/or identification of Staphylococcus 

593 epidermidis 

594, 595 for the detection and/or identification of Staphylococcus 

haemolyticus 

596, 597, 598 for the detection and/or identification of Staphylococcus 

hominis 
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599,600,601,695, for the detection and/or identification of Staphylococcus 
1208, 1209 saprophyticus 

602, 1235, 1237, for the detection and/or identification of Enterococcus 
1696, 1697, 1698, faecium 
1699, 1700, 1701, 
2286, 2287 

604 for the detection and/or identification of Enterococcus 

gallinarum 

620,1122 for the detection and/or identification of Enterococcus 

casseliflavus, Eflavescens and E. gallinarum 
629,630,2085, for the detection and/or identification of Chlamydia 
2086, 2087, 2088, pneumoniae 
2089,2090,2091, 
2092 

636, 637, 638, 639, for the detection and/or identification of at least the following: 
640, 641, 642 Abiotrophia adiacens, Abiotrophia defectiva, Acinetobacter 

baumannii, Acinetobacter Iwoffi, Aerococcus viridans, 
Bacillus anthracis, Bacillus cereus, Bacillus subtilis, Brucella 
abortus, Burkholderia cepacia, Citrobacter diversus, 
Citrobacter freundii, Enterobacter aerogenes, Enterobacter 
agglomerans, Enterobacter cloacae, Enterococcus annum, 
Enterococcus casseliflavus, Enterococcus dispar, 
Enterococcus durans, Enterococcus faecalis, Enterococcus 
faecium, Enterococcus flavescens, Enterococcus gallinarum, 
Enterococcus mundtii, Enterococcus raffinosus, 

Enterococcus solitaries, Escherichia coli, Gemella 
morbillorum, Haemophilus ducreyi, Haemophilus 
haemolyticus, Haemophilus influenzae, Haemophilus 
par ahaemolyticus, Haemophilus par -ainfluenzae, Hafnia alvei, 
Kingella kingae, Klebsiella oxytoca, Klebsiella pneumoniae, 
Legionella pneumophila, Megamonas hypermegale, 
Moraxella atlantae, Moraxella catarrhalis, Morganella 
morganii, Neisseria gonorrheae, Neisseria meningitidis, 
Pasteurella aerogenes, Pasteurella multocida, 
Peptostreptococcus magnus, Proteus mirabilis, Providencia 
alcalifaciens, Providencia rettgeri, Providencia rustigianii, 
Providencia stuartii, Pseudomonas aeruginosa, 

Pseudomonas fluorescens, Pseudomonas stutzeri, 
Salmonella bongori, Salmonella choleraesuis, Salmonella 
enteritidis, Salmonella gallinarum, Salmonella typhimurium, 
Serratia liquefaciens, Serratia marcescens, Slrigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus 
capitis Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Staphylococcus hominis, Staphylococcus 
lugdunensis, Staplrylococcus saprophyticus, Staphylococcus 
simulans, Staphylococcus warneri, Stenotrophomonas 
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maltophilia, Sfreptococcus acidominimus, Streptococcus 
agalactiae, Sfreptococcus anginosus, Streptococcus bovis, 
Streptococcus constellatus, Streptococcus cricetus, 
Streptococcus cristatus, Streptococcus dysgalactiae, 
Streptococcus equi, Sfreptococcus ferus, Streptococcus 
gordonii, Streptococcus intermedius, Streptococcus 
macacae, Streptococcus mitis, Streptococcus mutans, 
Streptococcus oralis, Streptococcus parasanguinis, 
Streptococcus parauberis, Streptococcus pneumoniae, 
Streptococcus pyogenes, Streptococcus ratti, Streptococcus 
salivarius, Streptococcus sanguinis, Streptococcus sobrinus, 
Streptococcus uberis, Streptococcus vestibularis, Vibrio 
cholerae, Yersinia enterocolitica, Yersinia pestis, Yersinia 
pseudotuberculosis. 

656, 657, 271, for the detection and/or identification of Enterococcus sp. 
1136,1137 

701 , 702 for the detection and/or identification of Leishmania sp. 

703, 704, 705, 706, for the detection and/or identification of Entamoeba sp. 
793 

794, 795 for the detection and/or identification of Trypanosoma cruzi 

796, 797, 808, 809, for the detection and/or identification of Clostridium sp. 
810,811 

798, 799, 800, 801, for the detection and/or identification of Cryptosporidium 

802,803,804,805, parvum 
806, 807 

816, 817, 81 8, 8 19 for the detection and/or identification of Giardia sp. 

820, 821, 822 for the detection and/or identification of Trypanosoma 

brucei 

823, 824 for the detection and/or identification of Trypanosoma sp. 

825, 826 for the detection and/or identification of Bordetella sp. 

923, 924, 925, 926, for the detection and/or identification of Trypanosomatidae 

927, 928 family 

933, 934 for the detection and/or identification ofEnterobacteriaceae 

group 

994,995,996,997, for the detection and/or identification of Streptococcus 

998, 999, 1000, pyogenes 
1001, 1200, 1210, 
1211 

1157,2079,2118 for the detection and/or identification of Candida 

parapsilosis 
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1158, 1159, 2078, 
2110,2111 
1160, 2077,2119, 
2120 

1161,2075,2112, 

2113,2114 

1162 

1162, 2080,2115 

2116,2117 

1165 

1201 

1233 

1329, 1330, 1331, 
1332,2167, 2281 
1661, 1665 

1690, 1691, 1692, 
1693,2169 
1694, 1695, 2122 

1971, 1972, 1973 
2081, 2082, 2083 
2084 

2093, 2094, 2095, 
2096 

2106,2107 
2131,2132, 2133 
2134,2135,2136 
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for the detection and/or identification of Candida glabrata 

for the detection and/or identification of Candida tropicalis 

for the detection and/or identification of Candida krusei 

for the detection and/or identification of Candida 
guilliermondii 

for the detection and/or identification of Candida lusitaniae 

for the detection and/or identification of Candida 
zeylanoides 

for the detection and/or identification of Streptococcus 
pneumoniae 

for the detection and/or identification of Staphylococcus sp. 
other than S. aureus 

for the detection and/or identification of Klebsiella 
pneumoniae 

for the detection and/or identification of Escherichia coli 
and Shigella sp. 

for the detection and/or identification of Acinetobacter 
baumanii 

for the detection and/or identification of Pseudomonas 
aeruginosa 

for the detection and/or identification of Ciyptococcus sp. 
for the detection and/or identification of Legionella sp. 
for the detection and/or identification of Legionella 
pneumophila 

for the detection and/or identification of Mycoplasma 
pneumoniae 

for the detection and/or identification of Ciyptococcus 
neoformans 

for the detection and/or identification of Campylobacter 
jejuni and C. coli 

for the detection and/or identification of Bacteroides fi-agilis 
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2170 for the detection and/or identification ofAbiotrophia adiacens 

2171 for the detection and/or identification of Gemella sp. 

2172 for the detection and/or identification of Enterococcus sp., 

Gemella sp., A. adiacens 

2 1 80, 2 1 8 1 , 2 1 82 for the detection and/or identification ofBordetella pertussis, 

said method comprising the step of contacting the nucleic acids of the sample 
with said primers or probes under suitable conditions of hybridization or of 
amplification and detecting the presence of hybridized probes or amplified 
products as an indication of the presence of said specificalgal, archaeal, bacterial, 
fungal or parasitical species, genus, family or group. 

18. A method for the universal detection of any bacterium, fungus or 
parasite in a sample, using a panel of probes or amplification primers or both, 
each individual probe or primer being derived from a nucleic acid as defined in 
claims 8 or 9, the method comprising the step of contacting the nucleic acids of 
the sample with said primers or probes under suitable conditions ofhybridization 
or of amplification and detecting the presence of any alga, archaeon, bacterium, 
fungus or parasite. 



19. A method as set forth in claim 17 or 18, which further comprises 
probes or primers, or both, for the detection of at least one antimicrobial agent 
resistance gene. 

20. A method as set forth in claim 17, 18 or 19, which further comprises 
probes or primers, or both, for the detection of at least one toxin gene. 

21. A method as set forth in claim 19 or 20, wherein the probes or 
primers for the detection of said antimicrobial agent resistance gene or toxin gene 
have at least 12 nucleotides in length capable ofhybridizing with an antimicrobial 
agent resistance gene and/or toxin gene selected from SEQ ID NOs.: 

1078, 1079, 1085 for the detection and/or identification of the£. coli Shiga- 

like toxin 2 (stx 2 ) gene 
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1080, 1081, 1084, 
2012 

1082, 1083 

1086, 1087, 1088, 
1089, 1090, 1091, 
1092, 1170, 1239, 
1240, 2292 
1095, 1096, 1171, 
1241,2294, 2295 
1111, 1112, 1113, 
1114, 1115, 1116, 
1118, 1119, 1120, 
1121, 1123, 1124 
1103, 1104, 1109, 
1110 

1105, 1106, 1107, 
1108 

1097, 1098, 1099, 
1100, 1101, 1102 
1150, 1153, 1154, 
1155 

1094, 1125, 1126, 
1127, 1128, 1129, 
1130, 1131, 1132, 
1133, 1134, 1135, 
1192, 1193, 1194, 
1195, 1196, 1197, 
1214, 1216, 1217, 
1218, 1219, 1220, 
2015, 2016, 2017, 
2018, 2019, 2020, 
2021, 2022, 2023, 
2024, 2025, 2026, 
2027, 2028, 2029, 
2030, 2031, 2032, 
2033, 2034, 2035, 
2036, 2037, 2038, 
2039 



for the detection and/or identification of the E. coli Shiga- 
like toxin 1 (stxj) gene 

for the detection and/or identification ofE. coli Shiga-like 
toxins 1 and 2 (stx) genes 

for the detection and/or identification of HhevanA resistance 
gene 



for the detection and/or identification of thevanB resistance 
gene 

for the detection and/or identification of the vanAB 
resistance genes 



for the detection and/or identification of the vanCl 
resistance gene 

for the detection and/or identification of the vanC2 and 
vanC3 resistance genes 

for the detection and/or identification of the vanCl, vanC2 
and vanC3 resistance genes 

for the detection and/or identification of the vanAXY 
resistance genes 

for the detection and/or identification of the S. pneumoniae 
pbpla gene 
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1142,1143, 1144, 
1145 

1146, 1147, 1148, 
1149 

1177, 1231 

1290, 1291, 1292, 
1293, 1294, 1295, 
1296, 1297, 1298, 
1333, 1334, 1335, 
1340, 1341, 1936, 
1937, 1940, 1942, 
1943, 1945, 1946, 
1947, 1948, 1949, 
2040, 2041, 2042, 
2043,2250, 2251 
1301, 1302, 1303, 
1304, 1305, 1306 
1308, 1309, 1310, 
1311, 1312, 1313, 
1314, 1315, 1316, 
1317, 1318, 1319, 
1336, 1337, 1338, 
1339, 1342, 1343, 
1934, 1935, 1938, 
1939, 1941, 1944, 
1950, 1951, 1952, 
1953, 1955, 2044, 
2045, 2046 
1322, 1323, 1324, 
1325, 1326, 1327 
1344, 1345, 1346, 
1347 

1349, 1350 

1352, 1353, 1354, 
1355 

1357, 1358, 1359, 
1360 

1362, 1363, 1364, 
1365 



for the detection and/or identification of theS. pneumoniae 
pbp2b gene 

for the detection and/or identification of theS. pneumoniae 
pbp2x gene 

for the detection and/or identification of themec^f resistance 
gene 

for the detection and/or identification of ihegyrA resistance 
gene 



for the detection and/or identification of thegyrB resistance 
gene 

for the detection and/or identification of theparC resistance 
gene 



for the detection and/or identification of HheparE resistance 
gene 

for the detection and/or identification of the aac(2')-Ia 
resistance gene 

for the detection and/or identification of the aac(3')-Ib 
resistance gene 

for the detection and/or identification of the aac(3')-IIb 
resistance gene 

for the detection and/or identification of the aac(3')-IVa 
resistance gene 

for the detection and/or identification of the aac(3')-VIa 
resistance gene 
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1367, 1368, 1369, for the detection and/or identification of the aac(6')-Ia 

1370 resistance gene 

1372, 1373, 1374, for the detection and/or identification of the aac(6')-lc 

1375 resistance gene 

1377, 1378, 1379, for the detection and/or identification of the ant(3')-Ia 

1380 resistance gene 

1382, 1383, 1384, for the detection and/or identification of the ant(4')-Ia 

1385 resistance gene 

1387, 1388, 1389, for the detection and/or identification of the aph(3')-Ia 

1390 resistance gene 

1392, 1393, 1394, for the detection and/or identification of the aph(3')-IIa 

1395 resistance gene 

1397, 1398, 1399, for the detection and/or identification of the aph(3')-IIIa 

1400 resistance gene 

1402, 1403, 1404, for the detection and/or identification of the aph(3')-VIa 

1405, 2252 resistance gene 

1407, 1408, 1409 for the detection and/or identification of the blaCARB 

1410 resistance gene 

1412, 1413, 1414, for the detection and/or identification of the blaCMY-2 

1415 resistance gene 

1417, 1418 for the detection and/or identification of the blaCTX-M- 

7 and blaCTX-M-2 resistance genes 

1419, 1420, 1421, for the detection and/or identification of the blaCTX-M-1 

1422 resistance gene 

1424, 1425, 1426, for the detection and/or identification of the blaCTX-M-2 

1427 resistance gene 

1429,1430,1431, for the detection and/or identification of the blaIMP 

1432 resistance gene 

1434, 1435 for the detection and/or identification of the blaOXA2 

resistance gene 

1436, 1437 for the detection and/or identification of the blaOXAW 

resistance gene 

1440, 1441 for the detection and/or identification of the blaPER-1 

resistance gene 
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1443, 1444 


for the detection and/or identification of the blaPER-2 




resistance gene 


1446, 1447, 1448, 


for the detection and/or identification of the blaPER-1 and 


1449 


blaPER -2 resistance genes 


1450, 1451 


for the detection and/or identification of tkiedfrA resistance 




gene 


1453, 1454, 1455, 


for the detection and/or identification of the dhjrla and 


1456 


dhfrXV resistance genes 


1457, 1458, 1459, 


for the detection and/or identification of the dhfrla 


1460, 2253 


resistance gene 


1462, 1463, 1464, 


for the detection and/or identification of the dhfrlb and 


1465 


dhfrV resistance genes 


1466, 1467, 1468, 


for the detection and/or identification of the dhfrlb 


1469 


resistance gene 


1471, 1472, 1473, 


for the detection and/or identification of the <#i/rFresistance 


1474 


gene 


1476, 1477, 1478, 


for the detection and/or identification of the dhfrVI 


1479 


resistance gene 


1481, 1482, 1483, 


for the detection and/or identification of the dhfrVJI and 


1484 


dhfrXVII resistance genes 


1485, 1486, 1487, 


for the detection and/or identification of the dhjrVII 


1488 


resistance gene 


1490, 1491, 1492, 


for the detection and/or identification of the dhfrVIII 


1493 


resistance gene 


1495, 1496, 1497, 


for the detection and/or identification of the dhfrIX 


1498 


resistance gene 


1500, 1501, 1502, 


for the detection and/or identification of the dhfrXII 


1503 


resistance gene 


1505, 1506 


for the detection and/or identification of the dhfrXHI 




resistance gene 


1508, 1509, 1510, 


for the detection and/or identification of the dhfrXV 


1511 


resistance gene 


1513, 1514, 1515, 


for the detection and/or identification of the dhfrXVII 


1516 


resistance gene 
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1528, 1529 for the detection and/or identification ofiheereAand ereA2 

resistance genes 

1531, 1532, 1533, for the detection and/or identification of iheereB resistance 

1534 gene 

1536, 1537, 1538, for the detection and/or identification of the/m^4 and linA' 

1539 resistance genes 

1541, 1542, 1543, for the detection and/or identification of thelinB resistance 

1544 gene 

1546, 1547 for the detection and/or identification of Xh&mefA resistance 

gene 

1549, 1550 for the detection and/or identification of theme/E resistance 

gene 

1552, 1553, 1554, for the detection and/or identification of thsmejA and mefE 

1555 resistance genes 

1556, 1557, 1558, for the detection and/or identification of the mphA and 

1 559 mphK resistance genes 

1581, 1582, 1583, for the detection and/or identification of ih&satG resistance 

1584 gene 

1586, 1587, 1588, for the detection and/or identification of the tetM resistance 

1589,2254 gene 

1591, 1592, 1593, for the detection and/or identification of thzvanD resistance 

2297 gene 

1595, 1596, 1597, for the detection and/or identification of ihevanE resistance 

1598 gene 

1609, 1610, 1611, for the detection and/or identification of ihevatB resistance 

1612 gene 

1614, 1615, 1616, for the detection and/or identification of ihevatC resistance 

1617 gene 

1619, 1620, 1621, for the detection and/or identification of the vga resistance 

1622 gene 

1624, 1625, 1626, for the detection and/or identification of thev^oS resistance 

1627 gene 

1629, 1630, 1631, for the detection and/or identification of the vgb and vgh 

1632 resistance genes 
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1634, 1635, 1636, 
1637 

1883, 1884, 1885, 
1886, 1887, 1888, 
1889, 1890, 1891, 
1892, 1893, 1894, 
1895, 1896, 1897, 
1898 

1906, 1907, 1908, 
1909, 1910, 1911, 
1912, 1913, 1914, 
1915, 1916, 1917, 
1918, 1919, 1920, 
1921, 1922, 1923, 
1924, 1925, 1926, 
2006, 2007, 2008, 
2009, 2141 
1961, 1962, 1963, 
1964 

1966, 1967, 1968, 
1969 

2065, 2066, 2067, 
2068, 2069, 2070, 
2071 

2098, 2099, 2100 

2102,2103,2104 

2123,2124,2125 

2126,2127,2128 

2142, 2143 

2145,2146 

2148, 2149 



for the detection and/or identification of thevgbB resistance 
gene 

for the detection and/or identification of the blaSHV 
resistance gene 



for the detection and/or identification of the blaTEM 
resistance gene 



for the detection and/or identification of thesulll resistance 
gene 

for the detection and/or identification of ihetetB resistance 
gene 

for the detection and/or identification of therpoB resistance 
gene 

for the detection and/or identification of iheinhA resistance 
gene 

for the detection and/or identification of theembB resistance 
gene 

for the detection and/or identification of theC. difficile cdtA 
toxin gene 

for the detection and/or identification of theC. difficile cdtB 
toxin gene 

for the detection and/or identification of the mupA 
resistance gene 

for the detection and/or identification of the cat! resistance 
gene 

for the detection and/or identification of ihecatll resistance 
gene 
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2151,2152 for the detection and/or identification of thecatlll resistance 

gene 

2154, 2155 for the detection and/or identification of thecatP resistance 

gene 

2157, 2158, 2160, for the detection and/or identification of the cat resistance 
2161 gene 

2163,2164 for the detection and/or identification of the ppflo-Wkz 

resistance gene. 



22. A composition of matter comprising a specific nucleic acid as set 
forth in claim 10 or 11, which is specific for a bacterial, fungal or parasitical 
species, genus, family, or group, or a nucleic acid as set forth in claim 8 or 9 
which is universal for a bacterium, fungus or parasite, or both specific and 
universal nucleic acids, in conjunction with a nucleic acid sequence of at least 12 
nucleotides capable of hybridizing with an antimicrobial agent resistance gene 
and/or toxin gene. 

23. A composition as set forth in claim 22, wherein the nucleic acid 
capable of hybridizing with an antimicrobial agent resistance gene and/or toxin 
gene is any one of: 

1078, 1079, 1085 for the detection and/or identification of the is. coli Shiga- 

like toxin 2 (stx 2 ) gene 
1080, 1081, 1084, for the detection and/or identification of the£. coli Shiga- 
2012 like toxin 1 (stxj) gene 

1082, 1083 for the detection and/or identification ofE. coli Shiga-like 

toxins 1 and 2 (stx) genes 
1086, 1087, 1088, for the detection and/or identification of thevanA resistance 
1089, 1090, 1091, gene 
1092, 1170, 1239, 
1240, 2292 

1095, 1096, 1171, for the detection and/or identification of thevanB resistance 
1241,2294,2295 gene 

1111, 1112, 1113, for the detection and/or identification of the vanAB 
1114, 1115, 1116, resistance genes 
1118, 1119, 1120, 
1121, 1123, 1124 
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1103, 1104, 1109, 
1110 

1105, 1106, 1107, 
1108 

1097, 1098, 1099, 
1100, 1101, 1102 
1150, 1153, 1154, 
1155 

1094, 1125, 1126, 
1127, 1128, 1129, 
1130, 1131, 1132, 
1133, 1134, 1135, 
1192, 1193, 1194, 
1195, 1196, 1197, 
1214, 1216, 1217, 
1218, 1219, 1220, 
2015, 2016, 2017, 
2018, 2019, 2020, 
2021, 2022, 2023, 
2024, 2025, 2026, 
2027, 2028, 2029, 
2030, 2031,2032, 
2033, 2034, 2035, 
2036, 2037, 2038, 
2039 

1142, 1143, 1144, 
1145 

1146, 1147, 1148, 
1149 

1177, 1231 

1290, 1291, 1292, 
1293, 1294, 1295, 
1296, 1297, 1298, 
1333, 1334, 1335, 
1340, 1341, 1936, 
1937, 1940, 1942, 
1943, 1945, 1946, 
1947, 1948, 1949, 
2040, 2041, 2042, 
2043,2250, 2251 



for the detection and/or identification of the vanCl 
resistance gene 

for the detection and/or identification of the vanC2 and 
vanC3 resistance genes 

for the detection and/or identification of ihevanCJ, vanC2 
and vanC3 resistance genes 

for the detection and/or identification of the vanAXY 
resistance genes 

for the detection and/or identification of theS. pneumoniae 
pbpla gene 



for the detection and/or identification of the & pneumoniae 
pbp2b gene 

for the detection and/or identification of theS. pneumoniae 
pbp2x gene 

for the detection and/or identification of themecA resistance 
gene 

for the detection and/or identification of ihegyrA resistance 
gene 



368 

SUBSTITUTE SHEET (RULE 26) 



WO 01/2J604 



1301, 1302, 1303, 
1304, 1305, 1306 
1308, 1309, 1310, 
1311, 1312, 1313, 
1314, 1315, 1316, 
1317, 1318, 1319, 
1336, 1337, 1338, 
1339, 1342, 1343, 
1934, 1935, 1938, 
1939, 1941,1944, 
1950, 1951, 1952, 
1953, 1955, 2044, 
2045, 2046 
1322, 1323, 1324, 
1325, 1326, 1327 
1344, 1345, 1346, 
1347 

1349, 1350 

1352, 1353, 1354, 
1355 

1357, 1358, 1359, 
1360 

1362, 1363, 1364, 
1365 

1367, 1368, 1369, 
1370 

1372, 1373, 1374, 
1375 

1377, 1378, 1379, 
1380 

1382, 1383, 1384, 
1385 

1387, 1388, 1389, 
1390 

1392, 1393, 1394, 
1395 

1397, 1398, 1399, 
1400 
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for the detection and/or identification of thegyrB resistance 
gene 

for the detection and/or identification of iheparC resistance 
gene 



for the detection and/or identification of ih&parE resistance 
gene 

for the detection and/or identification of the aac(2')-Ia 
resistance gene 

for the detection and/or identification of the aac(3')-Ib 
resistance gene 

for the detection and/or identification of the aac(3')-IIb 
resistance gene 

for the detection and/or identification of the aac(3')-IVa 
resistance gene 

for the detection and/or identification of the aac(3')-VIa 
resistance gene 

for the detection and/or identification of the aac(6')-Ia 
resistance gene 

for the detection and/or identification of the aac(6')-Ic 
resistance gene 

for the detection and/or identification of the ant(3')-Ia 
resistance gene 

for the detection and/or identification of the ant(4')-Ia 
resistance gene 

for the detection and/or identification of the aph(3')-Ia 
resistance gene 

for the detection and/or identification of the aph(3')-IIa 
resistance gene 

for the detection and/or identification of the aph(3')-IIIa 
resistance gene 
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1402, 1403, 1404, for the detection and/or identification of the aph(3')-VIa 
1405, 2252 resistance gene 

1407, 1408, 1409 for the detection and/or identification of the blaCARB 
1410 resistance gene 

1412, 1413, 1414, for the detection and/or identification of the blaCMY-2 
1415 resistance gene 

1417, 1418 for the detection and/or identification of the blaCTX-M- 

iand blaCTX-M-2 resistance genes 
1419, 1420, 1421, for the detection and/or identification of the blaCTX-M-1 
1422 resistance gene 

1424, 1425, 1426, for the detection and/or identification of the blaCTX-M-2 
I All resistance gene 

1429,1430,1431, for the detection and/or identification of the blaIMP 
1432 resistance gene 

1434, 1435 for the detection and/or identification of the blaOXA2 

resistance gene 

1436, 1437 for the detection and/or identification of the blaOXAlO 

resistance gene 

1440, 1441 for the detection and/or identification of the blaPER-1 

resistance gene 

1443, 1444 for the detection and/or identification of the blaPER-2 

resistance gene 

1446, 1447, 1448, for the detection and/or identification of the blaPER-1 and 
1449 blaPER -2 resistance genes 

1450, 1451 for the detection and/or identification of ih&dfrA resistance 

gene 

1453, 1454, 1455, for the detection and/or identification of the dhfrla and 
1456 dhfrXV resistance genes 

1457, 1458, 1459, for the detection and/or identification of the dhfrla 
1460, 2253 resistance gene 

1462, 1463, 1464, for the detection and/or identification of the dhfrlb and 
1465 dhfrV resistance genes 

1466, 1467, 1468, for the detection and/or identification of the dhfrlb 
1469 resistance gene 
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1471, 1472, 1473, for the detection and/or identification of the <i/z/rFresistance 
1474 gene 

1476, 1477, 1478, for the detection and/or identification of the dhfrVI 
1479 resistance gene 

1481, 1482, 1483, for the detection and/or identification of the dhjrVU and 
1 484 dhfrXVII resistance genes 

1485, 1486, 1487, for the detection and/or identification of the dhfrVU 
1488 resistance gene 

1490, 1491, 1492, for the detection and/or identification of the dhfi-VIII 
1493 resistance gene 

1495, 1496, 1497, for the detection and/or identification of the dhfrIX 
1498 resistance gene 

1500, 1501, 1502, for the detection and/or identification of the dhjrXlI 
1 5 03 resistance gene 

1505, 1506 for the detection and/or identification of the dhfrXIII 

resistance gene 

1508, 1509, 1510, for the detection and/or identification of the dhfrXV 
1511 resistance gene 

1513, 1514, 1515, for the detection and/or identification of the dhfrXVII 
1516 resistance gene 

1528, 1529 for the detection and/or identification of the errand ereA2 

resistance genes 

1531, 1532, 1533, for the detection and/or identification of theereB resistance 
1534 gene 

1536, 1537, 1538, for the detection and/or identification of thelinA and HnA' 
1539 resistance genes 

1541, 1542, 1543, for the detection and/or identification of th&linB resistance 
1544 gene 

1546, 1547 for the detection and/or identification of ihemefA resistance 

gene 

1549, 1550 for the detection and/or identification of ih&mefE resistance 

gene 

1552, 1553, 1554, for the detection and/or identification of ihemefA and mefE 
1555 resistance genes 
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1556, 1557, 1558, 
1559 

1581, 1582, 1583, 
1584 

1586, 1587, 1588, 
1589, 2254 
1591, 1592, 1593, 
2297 

1595, 1596, 1597, 
1598 

1609, 1610, 1611, 
1612 

1614, 1615, 1616, 
1617 

1619, 1620, 1621, 
1622 

1624, 1625., 1626, 
1627 

1629, 1630, 1631, 
1632 

1634, 1635, 1636, 
1637 

1883, 1884, 1885, 
1886, 1887, 1888, 
1889, 1890, 1891, 
1892, 1893, 1894, 
1895, 1896, 1897, 
1898 

1906, 1907, 1908, 
1909, 1910, 1911, 
1912, 1913, 1914, 
1915, 1916, 1917, 
1918, 1919, 1920, 
1921, 1922, 1923, 
1924, 1925, 1926, 
2006, 2007, 2008, 
2009, 2141 
1961, 1962, 1963, 
1964 



for the detection and/or identification of the mpJtA and 
mphK resistance genes 

for the detection and/or identification of thesatG resistance 
gene 

for the detection and/or identification of the tetM resistance 
gene 

for the detection and/or identification of thevanD resistance 
gene 

for the detection and/or identification of ihevanE resistance 
gene 

for the detection and/or identification of thevatB resistance 
gene 

for the detection and/or identification of thevatC resistance 
gene 

for the detection and/or identification of thevga resistance 
gene 

for the detection and/or identification of HhevgaB resistance 
gene 

for the detection and/or identification of they^ and vgh 
resistance genes 

for the detection and/or identification of thevgbB resistance 
gene 

for the detection and/or identification of the blaSHV 
resistance gene 



for the detection and/or identification of the blaTEM 
resistance gene 



for the detection and/or identification of ihesulll resistance 
gene 
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1966, 1967, 1968, for the detection and/or identification of thetetB resistance 
1969 gene 

2065, 2066, 2067, for the detection and/or identification of therpoB resistance 

2068,2069,2070, gene 

2071 

2098, 2099, 2100 for the detection and/or identification of iheinhA resistance 

gene 

2102, 2103, 2104 for the detection and/or identification of theembB resistance 

gene 

2123, 2124, 2125 for the detection and/or identification of theC difficile cdtA 

toxin gene 

2126, 2127, 2128 for the detection and/or identification of theC. difficile cdtB 

toxin gene 

2142, 2143 for the detection and/or identification of the mupA 

resistance gene 

2145, 2146 for the detection and/or identification of the c<z/7 resistance 

gene 

2148, 2149 for the detection and/or identification of thecatll resistance 

gene 

2151,2152 for the detection and/or identification of thecatlll resistance 

gene 

2154, 2155 for the detection and/or identification of ihecatP resistance 

gene 

2157, 2158, 2160, for the detection and/or identification of the cat resistance 
2161 gene 

2163,2164 for the detection and/or identification of the ppflo-X\ke, 

resistance gene. 

24. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of thetuf sequences defined 
in SEQ ID NOs.: 1-73, 75-241, 399-457, 498-529, 612-618, 621-624, 675, 677, 
717-736, 779-792, 840-855, 865, 868-888, 897-910, 932, 967-989, 992, 1266- 
1287, 1518-1526, 1561-1575, 1578-1580, 1662-1664, 1666-1667, 1669-1670, 
1673-1683, 1685-1689, 1786-1843, 1874-1881, 1956-1960, 2183-2185, 2187- 
2188, 2193-2201, 2214-2249, 2255-2272. 
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25. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of iheatpD sequences 
denned in SEQ ID NOs.: 242-270, 272-398, 458-497, 530-538, 663, 667, 673, 
674, 676, 678-680, 737-778, 827-832, 834-839, 856-862, 866-867, 889-896, 929- 
931, 941-966, 1245-1254, 1256-1265, 1527, 1576-1577, 1600-1604,1638-1647, 
1649-1660, 1671, 1684, 1844-1848, 1849-1865,2189-2192. 

26. A nucleic acid having at least 12 nucleotides in length, capable of 
hybridizing with the nucleotide sequence of any one of the recA sequences 
defined in SEQ ID NOs.: 990-991, 1003, 1288-1289, 1714, 1756-1763, 1866- 
1873 and 2202-2212. 

27. A nucleic acid having at least 12 nucleotides in length, capable of 
selectively hybridizing with the nucleotide sequence of any one of the 
antimicrobial agent resistance gene sequences defined in SEQ ID NOs.: 1004- 
1075, 1255, 1607-1608, 1648, 1764-1785,2013-2014, 2056-2064,2273-2280. 

28. The nucleic acid sequences of the nucleic acids of any one of claims 
24 to 27. 

29. The use of a nucleic acid having at least 12 nucleotides in length 
capable of hybridizing with the nucleic acids of any one of the antimicrobial 
agent resistance genes sequences defined in SEQ ID NOs.: 1004-1075, 1255, 
1607-1608, 1648, 1764-1785, 2013-2014, 2056-2064, 2273-2280 for the 
detection and identification of microbial species. 

30. The use of a nucleic acid having at least 12 nucleotides in length 
capable of hybridizing with the nucleic acids of any one of the toxin genes 
denned in SEQ ID NOs.: 1078-1085, 2012 and 2123 to 2128 for the detection and 
identification of microbial species. 

31. A repertory of hexA nucleic acids used for the detection and/or 
identification of Streptococcus pneumoniae, which repertory is created by 
amplifying 
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the nucleic acids of any streptococcal species with any combination ot 
primers SEQ ID NOs.: 1179, 1181 and 1182. 

32. A repertory as defined in claim 31, which comprises the nucleic 
acids having a nucleotide sequence defined in SEQ ID NOs.: 1 184 to 1 191. 

33. A repertory of nucleic acid sequences derived from the repertory of 
claim 3 1 or 32. 

34. A nucleic acid used for the specific and ubiquitous detection and for 
identification of Streptococcus pneumoniae, which is derived from the repertory 
of claim 31. 

35. A nucleic acid as set forth in claim 34 which has a nucleic acid 
sequence of at least 12 nucleotides capable of hybridizing with said any 
Streptococcus pneumoniae and with any one of SEQ ID NOs.: 1 184 to 1 187. 

36. A nucleic acid as set forth in claim 34, which has a nucleic acid 
sequence of at least 12 nucleotides capable of hybridizing with the nucleic acids 
of Streptococcus pneumoniae and with any one of the nucleic acids having SEQ 
ID NOs.: 1179, 1180, 1181, 1182. 

37. A peptide derived from the translation of the nucleic acids from the 
repertory obtained from the method of claim 1, 31 or 32, or of the nucleic acids 
defined in any one of claims 24 to 27, 35 and 36. 

38. A peptide sequence derived from the peptide of claim 37. 

39. A recombinant vector comprising a nucleic acid obtained from the 
method of claim 1, 31 or 32, or from the nucleic acids defined in any one of 
claims 24 to 27, 35 and 36. 

40. A recombinant vector as defined in claim 39 which is an expression 
vector. 
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41. A recombinant host cell comprising the recombinant vector defined 
in claim 39 or 40. 

42. The use of the nucleic acid sequences defined in claim 28 or 33, or 
obtained from the method of claim 2 and of the protein sequences deduced from 
said nucleic acid sequences, for the design of a therapeutic agent effective against 
said microorganisms. 

43. The use as defined in claim 42, wherein said therapeutic agent is an 
antimicrobial agent, a vaccine or a genie therapeutic agent. 

44. A method for identification of a microorganism in a test sample, 
comprising the steps of: 

a) obtaining a nucleic acid sequence for a tuf, atpD, and/or recA 
genes of said microorganisms, and 

b) comparing said nucleic acid sequence with the nucleic acid 
sequences of a bank as defined in claim 5, said repertory comprising a 
nucleic acid sequence obtained from the nucleic acids of said 
microorganism, whereby said microorganism is identified when said 
comparison results in a match between said sequences. 
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(1) GENERAL INFORMATION: 
(i) APPLICANTS': 

5 BERGERON, Michel G. \ 1145 des Erables, Quebec City, 

Quebec, Canada, G2K 1T8 
BOISSINOT, .Maurice \ 109 Jean-Bruchesi, St-Augustin- 

de-Desmaures, Quebec, Canada, G3A 2N2 
HULETSKY, Ann \ 1231 Av des Pins, Sillery, Quebec, 
10 ^Canada, G1S 4J3 

MENARD, Christian \ 1174 Rue du Pont, St -Lambert -de - 

Levis, Quebec, Canada, GOS 2W0 
OUELLETTE, Marc \ 1035 de Ploermel, Sillery, Quebec, 
Canada, G1S 3S1 . 
15 PICARD, Francois J, \ 1245 de la Sapiniere, Cap- 

Rouge, Quebec, Canada, G1Y 1A1 
ROY, Paul H. 2 , 28 Charles Garnier, Ldretteville, 
Quebec, Canada, G2A 2X8 

20 x : Canadian citizenship 

2 : American citizenship 

(ii) TITLE OF THE INVENTION: HIGHLY CONSERVED GENES AN THEIR 
USE TO GENERATE SPECIES- SPECIFIC, GENUS -SPECIFIC AND 
25 . . UNIVERSAL NUCLEIC ACID PROBES AND AMPLIFICATION PRIMERS TO 
RAPIDLY DETECT AND IDENTIFY ALGAL, ARCHAEAL, BACTERIAL, 
FUNGAL AND PARASITICAL MICROORGANISMS FROM CLINICAL 
SPECIMENS FOR DIAGNOSIS 

30 (iii) NUMBER OF SEQUENCES: 2297 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : 

35 (B) STREET: 

(C) CITY: 

(D) STATE : 

(E) COUNTRY: 

(F) ZIP: 



40 



(v) COMPUTER READABLE: 



(A) MEDIUM TYPE : 

(B) COMPUTER: 
45 (C) OPERATING: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

50 (A) APPLICATION: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

55 

(A) APPLICATION: 

(B) FILING DATE: 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(B) REGISTRATION NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 

(B) TELEFAX: 
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2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobac ter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

CAAACTCGTG AGCACATCCT TCTTTCTCGT CAGGTAGGTG TACCTTACAT 50 

CATCGTATTC TTAAACAAAT GCGACCTTGT TGATGACGAA GAATTACTTG 100 

AATTAGTAGA AATGGAAGTA CGTGAACTTC TTTCTACTTA TGACTTCCCA 150 

GGTGATGACA CTCCAGTAAT CCGTGGTTCA GCTCTTGCAG CGCTTAACGG 200 

TGAAGCTGGT CCTTACGGTG AAGAATCAGT TCTTGCTCTT GTAGCAGCAC 250 

TTGACTCTTA CATCCCAGAG CCAGAGCGTG CAATCGACAA AGCATTCTTG 3 00 

ATGC CAATCG AAGACGTATT CTCAATTTCT GGT CGTGGTA CAGTAGTAAC 35 0 
AGGCCGTGTT GAAGCTGGTA TCATCAAAGT TGGTGAAGAA GTAGAGATCG • 400 

TTGGTATTAA AGATACAGTT AAAACAACTG TAACTGGCGT AGAAATGTTC 450 

CGTAAACTTC TTGACGAAGG CCGTGCAGGT GAGAACTGTG GTATCTTACT 500 

TCGTGGTACT . AAGCGTGAAG AAGTACAACG TGGT CAAGTA CTTGCTAAAC 550 

CAGGTACAAT CAAGCCGCAC ACTAAATTCG ACGCAGAAGT ATACGTACTT 600 

TCTAAAGAAG AAGGTGGTCG TCACACTCCA TTCTTAAATG GTTACCGTCC 650 

ACAGTTCTAC TTCCGTACAA CTGACGTAAC TGGTGCRATC CAGTTGAAAG 700 

AAGGCGTTGA AATGGTAATG CCAGGTGACA ACGTTGAAAT GTCAGTAGAA 750 



2) INFORMATION FOR SEQ ID NO : 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
{uT TUFuTutiyT " linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



CGGTGCGATC CTCGTGGTCG CCGCGACCGA CGGCCCCATG 
GCGAGCACGT CCTGCTCGCC CGTCAGGTCG GCGTTCCCAC 
GCCCTCAACA AGTCCGACAT GGTTGACGAC GAGGAAATGA 
CGAGGAGGAG TGCCGCGACC TGCTGGAGTC CCAGGACTTC 
CCCCGATCGT CCAGGTTTCC GCTCTGAAGG CCCTCGAGGG 
TGGGTTGCCA AGATCGAGGA GCTCATGGAG GCTGTGGATT 
CACCCCCGAG CGCGATATGG ACAAGCCCTT CCTCATGCCG 
TCTTCACGAT CACAGGTCGT GGCACGGTCG TCACGGGGCG 
GGCAAGCTGC CGATCAACTC CGAGGTCGAG ATCCTCGGTA 



GCCCAGACCC 
CATCCTCATC 
TGGAACTGGT 
GATCGCGATG 
CGACGCGGAG 
CCTACATCCC 
ATCGAGGACG 
TGTTGAGCGT 
TCCGTGATCC 
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CCAGAAGACC ACGGTCACCG GCATCGAGAT 
AGGCATGGGC CGGCGAGAAC TGTGGCCTGC 
GATGAGGTTG AGCGCGGCCA GGTTGTGGCC 
TCACACCGAG TTCGAGGGCC AGGTTTACAT 
5 GCCGTCACAA CCCGTTCTTC TCGAACTACC 
ACCACGGACG TGACCGGCGT CATCACCCTC 
CATGCCTGGC GACACCACCG AGATCTCCGT 
CCATGGAGCC CGGCTGGGCT TCGCCA 



GTTCCACAAG TCGATGGACG 500 

TGCTGCGCGG TACCAAGCGC 550 

ATTCCCGGCT CCATCACGCC 600 

CCTCAAGAAG GAAGAGGGCG 650 

GTCCGCAGTT CTACTTCCGT 70 0 

CCCGAGGGCA CCGACATGGT 750 

TCAGCTGATC CAGCCCATCG 800 

826 



2) INFORMATION FOR SEQ ID NO : 3 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aerococcus viridans 

(B) STRAIN: ATCC 11563 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



TGGTGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTG AG CACAT CCTTTTAGCT GGCCAAATCG GTGTTCCTGC ATTCGTAGTA 100 

30 TTCTTAAACA AAGTTGACCA AGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA GTACAACTAC CCAGGTGACG 200 

ATCTACCTGT AATCGCTGGT TCTGCTTTAT TAGCATTACA AGGCGATGAA 250 

GCTCAAGAAG CTAAAATCAT GGAATTAATG GAAGCTGTAG ACTCTTACAT 300 

TCCAGAACCA GAACGTGACA ACGACAAACC ATT CATGATG CCAATTGAGG 3 50 

35 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA " 4 00 

CGTGGTGAAG TTCGTACAGG TGACGAAGTT GACATCGTTG GTATTGCTGA 450 

ACAAATCGGT AAATCAGTTG TAACTGGTGT TGAAATGTTC CGTAAAAACT 500 

TAGACTACGC TCAAGCTGGT GACAACATCG GTGCATTATT ACGTGGTGTT 550 

CAACGTGAAG ACATCCAACG TGGTCAAGTA TTGGCTGCTC CTGGTTCAAT 600 

40 CACTCCACAT ACTAAATTTA AAGCGCAAGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACACCA TTCTTAACTA ACTACCGTCC ACAATTCTAC 70 0 

TTeccrrj^i^T 'tsjSSSzxrS^I T<iu T i^ r i^ r cT al"i^^ : caL : luuiAi3&5&*u vmT ~ 

TATGGTTATG CCTGGTGACA ACGTTGATAT GGACGTTGAA TTGATTCACC 800 

CAGTTGCGAT CGAAGATGGT ACTAAATTCT CTATC 835 

45 



2) INFORMATION FOR SEQ ID NO : 4 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Achromobacter xylosoxldans subsp. 
60 denitrif icons 

4 
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(B) 



STRAIN: ATCC 15173 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



CCTGGTGGTG 
TCCTGCTGAG 
AAGGCCGACA 
AGTTCGCGAR 
TCGTGAAGGG 
GGCGAACAGG 
GACGCCTGAG 
TGTTCTCGAT 
GG CATC AT CA 
GGTGAAGACG 
AAGGTCAAGC 
GAAGACGTCC 
GCACACGGAC 
GCCGTCACAC 
ACGACGGACG 
CCTGCCGGGC 
CCATGGAAGA 



TCGGCCGCTG 
CCGCCAGGTT 
TGGTTGACGA 
CTGCTGAGCA 
TTCGGCCAAG 
CCATCATGGC 
CGTGCCGTTG 
CTCGGGTCGC 
AGGTCGGCGA 
ACCTGCACGG 
CGGCGACAAC 
AGCGCGGCCA 
TTCACGTCCG 
TCCGTTCTTC 
TGACGGGCAC 
GACAACGTGG 
AGGCCTGCGT 



ACGGCCCGAT GCCGCAAACG CGCGAACACA 
GGCGTGCCGT ACATCATCGT CTTCCTGAAC 
CGCCGAGCTG CTTGAGCTGG TGGAAATGGA 
AGTACGACTT CCCGGGCGAC GACACCCCGA 
CTGGCGCTGG AAGGCGACAA GGGCGAACTG 
GCTGGCCGCT GCGCTGGACT CGTACATCCC 
ACGGCGCGTT CCTGATGCCG GTTGAAGACG 
GGCACCGTGG TGACCGGCCG TATCGAACGC 
GGAAAT CGAA ATCGTCGGTC TGGTGCCGAC 
GCGTGGAAAT GTTCCGCAAG CTGCTGGACC 
GTGGGCATCC TRCTGCGCGG CACCAAGCGT 
GGTTCTGGCC AAGCCGGGCT CGATCACCCC 
AGGTGTACAT CCTGTCCAAG GAAGAAGGCG 
CAAGGCTATC GTCCCCAGTT CTACTTCCGC 
GATCGAGCTG CCGGCCGACA AGGAAATGGT 
CCATGACGGT CAAGCTGCTG GCTCCGATCG 
TCGCCAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
827 



2) INFORMATION FOR SEQ ID NO : 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anaerorhabdus furcosus 

(B) STRAIN: ATCC 25662 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

TGGATCAATC CTAGTAGTTG CTGCAACTGA TGGACCAATG CCTCAAACTC 50 

GTGAACATAT CTTACTTGCT CGTCAAGTAG GTGTTC CAAG JU^TGGTTGTA 100 

TTCTTGAACA AATGCGACAT GGTTGAAGAT GAAGAATTAA TCGACCTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTAAGTGC TTACGGTTTC GAAGGTGATG 200 

ATACACCAGT TATCCGTGGT TCTGCATTAA AATCTCTTGA AGGAAATGCT 250 

GATTGGGAAG CAAAAGTTGC TGAATTAATG GATGCAGTTG ACT CTTGGAT 300 

TCCAACTCCA ACTCATGAAA CAGACAAACC ATTCTTAATG GCTGTTGAAG 350 

ATGTATTCAC AATTACAGGT CGTGGTACAG TTGCTACTGG ACGTGTTGAA 4 00 

CGTGGACACT TAAACCTTAA CGAAGAAGTT GAAATCGTTG GTATTCATGA 450 

TACTAAGAAA TCAGTTGTTA CTGGTATCGA AATGTTCCGT AAATTATTAG 500 

ACTATGCTGA AGCAGGAGAC AACATTGGTG CATTATTACG TGGTGTTTCT 550 

CGTGATGAAA TCGAACGTGG ACAATGTCTA GCTAAACCTG GAT CAGTTAC 6 00 

TCCACATACA GCTTTCAAAG CTCAAGTATA CGTATTAACT AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC GTAACTAACT ACCGTCCTCA ATTCTATTTC 700 

CGTACAACTG ACGTAACAGG AGTTGTTAAA CTTCCTGAAG GTACTGAAAT 750 

GGTTATGCCT GGAGACAACA TCGAAATGAT CGTTGAATTA ATCGCTCCAA 800 

TCGCTGTTGA ACAAGGAACT AAG 823 
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2 ) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

15 (xi > SEQUENCE DESCRIPTION: SEQ ID NO: 6 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACTC 50 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTACCTTA CATCGTTGTA 100 

TTCTTAAACA AATGCGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

20 AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 200 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 250 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 3 00 

CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 35 0 

ACGTATTCTC TATGACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 40 0 

25 CGCGGTATCG TTAAAGTTGG TGACGTAGTA GAAATCAT CG GTCTTGCTGA 45 0 

AG AAAATG CT TCTACAACTG TAACTGGTGT AGAGATGTTC CGTAAACTTC 50 0 

TTGACCAAGC TCAAGCTGGA GACAACATCG GTGCTTTACT TCGTGGGGTT 55 0 

GCTCGTGAAG ACATCCAACG TGGACAAGTA CTTGCAAAAA GCGGTTCTGT 500 

AAAAGCTCAC GCTAAATTCA AAGCTGAAGT TTTCGTATTA TCTAAAGAAG 650 

30 AAGGTGGACG TCACACTCCA TTCTTCGCTA ACTACCGTCC TCAGTTCTAC 70 0 

TTCCGTACAA CTGACGTAAC TGGTATCATC CAATTACCAG AAGGTACTGA 750 

AATGGTAATG C CTGGTGAC A ACAT CGAAAT GACTATCGAA CTTATCGCTC 80 0 

CAATCGCTAT CGAAGAGGGA ACTAA 825 



35 



50 



2 ) INFORMATION FOR SEQ ID NO : 7 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(D) TOPOLOGY: Linear 
45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACAC 50 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTTCCTTA CATCGTTGTA 100 

55 TTCTTAAACA AATGCGACAT GGTAGAT GAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 200 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 250 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 300 

CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 350 

60 ACGTATTCTC TATCACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 400 
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CGCGGTATCG TTAAAGTTGG 
AGAAAATGCT TCTACAACTG 
TTGACCAAGC TCAAGCTGGA 
GCTCGTGAAG ACATCCAACG 
5 AAAAGCTCAC GCTAAATTCA 
AAGGTGGACG TCACACTCCA 
TTC CGTACAA CTGACGTAAC 
AATGGTAATG CCTGGTGACA 
CAATCGCTAT CGAAGAGGGA 



TGACGTAGTA GAAATCATCG 
TAACTGGTGT AGAGATGTTC 
GACAACATCG GTGCTTTACT 
TGGACAAGTA' CTTGCAAAAA 
AAGCTGAAGT TTTCGTATTA 
TTCTTCGCTA ACTACCGTCC 
TGGTATCATC CAATTACCAG 
ACATTGAAAT G ACT AT CG AA 
ACTAAATTC 



GTCTTGCTGA 450 

CGTAAACTTC 500 

TCGTGGGGTT 550 

GCGGTTCTGT 600 * 

TCTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 750 

CTTATCGCTC 800 
829 



2 ) INFORMATION FOR SEQ ID NO : 8 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonls 
25 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CGGTGCTATC ATCGTAGTTG CTGCTACTGA TGGTCCTATG CCTCAAACTC 50 

30 GCGAGCACAT CCTTTTGGCT CGTCAGGTAA ACGTTCCGAG ATTGGTTGTA 100 

TTCATGAACA AGTGTGACAT GGTTGACGAC • GAGGAAATGT TGGAATTGGT 150 

TGAGATGGAG ATGAGAGAGT TGCTTTCATT CTATCAATTC GACGGTGACA 20 0 

ACACTCCGAT CATCCGTGGT TCTGCTCTTG GTGCATTGAA CGGTGATGCT 250 

CAATGGGAAG ATAAAGTAAT GGAGTTGATG GAAGCTTGTG ATACTTGGAT 30 0 

35 TCCTCTGCCT CCGCGCGAAA TCGACAAGCC GTTCTTGATG CCGGTTGAGG 350 

ACGTATTCTC AATCACGGGT CGTGGTACTG TTGCTACAGG TCGTATCGAG 400 

ACAGGTATTG TTAAGGTTGG TGAGGAAGTT CAGATCATCG GTCTTGGCGC 450 

TGCTGGTAAG AAATCTGTTG TTACAGGTGT TGAGATGTTC CGTAAGTTAT 50 0 

TGGATCAAGG TGAGGCTGGT GATAACGTTG GTTTGTTGCT TCGCGGTATC 550 

40 GATAAGAATG AGATCAAGCG TGGTATGGTA ATCTGCCACC CGGGTCAGGT 60 0 

TAAAGAGCAT TCTAAGTTCA AGGCTGAGGT TTATATCTTG AAGAAAGAGG 650 

AAGGTGGTCG TCACACTCCG TTCCACAACA AATATC^TCC TCAGTTCTAT ~" 70~0 " 

ATCCGTACAT TGGATGTAAC TGGTGAGATC ACTTTGCCGG AAGGAACTGA 750 

AATGGTAATG CCGGGTGATA ACGTAACGAT CGAGGTTGAG TTGATCTATC 80 0 

45 CGGTAGCATG TAGCGTAG 818 



2 ) INFORMATION FOR SEQ ID NO : 9 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
60 (A) ORGANISM: Enterococcus casseliflavus 

7 
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(B) STRAIN: R763 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

5 GGTCCTATGC CTCAAACACG TGAACACATC TTGTTATCAC GTAACGTTGG 50 

TGTAC CATAC ATCGTTGTTT TCTTAAACAA AATGGATATG GTTGATGACG 10 0 

AAGAATTACT AGAATTAGTT GAAATGGAAG TTCGTGACTT ATTGTCAGAA 150 

TATGACTTCC CAGGCGACGA TGTTCCTGTA ATCGCTGGTT CTGCTTTGAA 200 

AGCTCTTGAA GGCGATGCTT CATACGAAGA AAAAATCATG GAATTAATGG 250 

10 CTGCAGTTGA CGAATACGTT CCAACTCCAG AACGTGACAC TGACAAACCA 3 00 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 3 50 

TGCTACAGGC CGTGTTGAAC GTGGACAAGT TCGCGTTGGT GACGAAGTTG 400 

AAATCGTTGG TATTGCTGAA GAAACTGCTA AAACAACTGT AACTGGTGTT 450 

GAAATGTTCC GTAAATTGTT AGACTATGCT GAAGCAGGGG ATAACATTGG 500 

15 TGCATTGCTA CGTGGTGTTG CTCGTGAAGA CATC CAACGT GGACAAGTAT 550 

TGGCTAAAGC TGGTACAATC ACACCTCATA CAAAATTTAA AGCTGAAGTT 600 

TACGTTTTAA CAAAAGAAGA AGGTGGACGT CACACACCA 639 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 10 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 692 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 19 7 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GAACACATTC TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT 50 

CTTAAACAAA GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG 100 

AAATGGAAGT TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT 150 

40 GTACCTGTAA TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA 20 0 

CTATGAGCAA AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC 250 

CAACACCAGA ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC 300 

GTATTCTCAA TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG 350 

TGGT CAAATC AAAGTCGGTG AAGAAAT CGA AATCATCGGT ATGCAAGAAG . 400 

45 AATCAAGCAA AACAACTGTT ACTGGTGTAG AAATGTTC CG TAAATTATTA 450 

GACTACGCTG AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC 500 

ACGTGATGAC GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA 550 

CACCACATAC AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA 6 00 

GGTGGTCGTC ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 650 

50 CCGTACTACT GACGTAACTG GTGTTGTTAA CTTACCAGAA GG 692 



55 



2) INFORMATION FOR SEQ ID NO: 11 



( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacteroldes ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

10 CGGTGCTATC ATCGTTTGTG CTGCAACTGA TGGTCCGATG CCTCAAACTC 50 

GCGAACACAT TCTGTTAGCT CGTCAGGTAA ACGTACCTCG TCTGGTTGTA 10 0 

TTCTTGAACA AATGCGATAT GGTAGACGAC GAAGAAATGT TGGAACTCGT 150 

TGAAATGGAA ATGAGAGAAC TCCTTTCATT CTATGATTTC GATGGTGACA 200 

ATACTCCTAT CATCCGTGGT TCTGCTCTTG GCGCATTGAA CGGTGTTGAA 250 

15 AAATGGGAAG ACAAAGTTAT GGAACTGATG GATGCAGTTG ATAACTGGAT 3 00 

TCCACTGCCT CCGCGCGATG TTGATAAACC ATTCTTGATG CCGGTTGAAG 350 

ACGTGTTCTC TATCACAGGT CGTGGTACTG TAGCAACAGG TCGTATCGAA 4 00 

ACAGGTGTCA TCCACGTTGG TGATGAAGTC GAAATTCTTG GTTTAGGTGA 450 

AGATAAGAAA TCAGTTGTAA CTGGTGTTGA AATGTTCCGT AAACTGTTGG 5 00 

20 ATCAAGGTGA AGCTGGTGAC AACGTAGGTC TTTTGCTTCG TGGTATTGAC 550 

AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAACCAG GTCAGATTAA 600 

ACCGCACTCT AAATTCAAAG CTGAGGTTTA TATCTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTACTTG 700 

CGTACTATGG ACTGTACAGG TGAAATCACT TTGCCGGAAG GAACAGAAAT 750 

25 GGTAATGCCG GGTGATAACG TAACTATTAC AGTTGAGTTG ATTTACCCAG 800 

TAGCATTGAA CCCGGGCTTC G 821 



30 2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 
(B) STRAIN :" " ATCC" 4 9882* " 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



TGGTGCGATT TTGGTTGTTT CAGCTGCTGA TGGTCCGATG CCTCAAACAC 50 

GTGAGCATAT TCTTCTTGCC CGTCAGGTTG GTGTTCCAGC GATTGTTGTT 100 

TTTCTTAATA AGGTTGATCA GGTTGATGAT GCTGAGCTTT TGGAGCTTGT 150 

TGAGCTTGAA GTTCGGGAGT TATTGTCGAA ATATGATTTT CCAGGAGACG 20 0 

50 ATATTCCGAT CGTTAAAGGT TCTGCTTTGG CAGCGCTTGA AGATAAAGAT 250 

AAAAGCATTG GTGAAGATGC GGTTCGTCTT TTGATGAGTG AAGTTGATAA 300 

TTATATACCG ACGCCTGAAC GTCCTGTTGA TCAGCCGTTT TTGATGCCAA 35 0 

TTGAAGATGT TTTTTCGATT TCGGGTCGTG GAACTGTTGT GACGGGTCGT 400 

GTTGAGCGTG GTGTTATTAA GGTTGGTGAA GAAGTTGAGA TTATCGGCAT 45 0 

55 TCGTCCAACT TCTAAGACAA CAGTTACAGG GGTTGAAATG TTCCGCAAGC 50 0 

TTTTAGATCA GGGGCAAGCG GGTGATAATA TTGGAGCGCT GCTTCGTGGT 550 

ATTGATCGTG AAGGGATTGA GCGTGGACAA GTTTTGGCGA AGCCTGCTTC 600 

GGTTACACCT CATACGAGAT TTAAAGCAGA GGCTTACATT TTGACGAAAG 650 

ATGAAGGTGG TCGTCATACT CCATTTTTCA CGAATTATCG TCCTCAGTTT 700 

60 TATTTCCGTA CTACGGATGT AACGGGAATT GTTACGCTTC CAGAAGGTAC 750 
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AGAGATGGTT ATGCCTGGTG ATAATGTTGC TATGGATGTC TCTCTGATTG 8 00 

TTCCAATTGC CATGGAAGAA AAACTTCGTT TTGCTATC 838 



2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TGGCGCCATC CTTGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTCCCGAA GATCCTCGTC 100 

GCTCTGAACA AGTGCGATAT GGTCGACGAC GACGAGCTCA TCGAGCTCGT 150 

TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAATGGCTTC GATCGCGATT 200 

GCCCGGTCAT CCACGTGTCC GCTTACGGCG CACTGCACGA TGACGCTCCG 250 

GACCACGAGA AGTGGGTTGA GCAGATCAAG AAGCTCATGG ACGCCGTCGA 3 00 

TGACTACATC CCGACCCCGG TCCACGATCT GGACAAGCCG TTCCTGATGC 3 50 

CGATCGAAGA TGTCTTCACC ATCTCCGGCC GTGGCACCGT GGTGACCGGC 4 00 

CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTCG AGATCGTCGG 450 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCAT CGAG ACCTTCCACA 500 

AGCAGATGGA CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 600 

CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGCCGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 800 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG 839 



2 ) INFORMATION FOR SEQ ID NO: 14 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium dentium 

(B) STRAIN: ATCC 27534 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTGCCGCG TATCCTCGTC 100 

GCCCTGAACA AGTGCGATAT GGTCGACGAC GAAGAGCTCA TCGAGCTCGT 150 
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TGAGGAAGAG GTCCGTGACC TCCTCGACGA 
GCCCGGTCAT CCACACCTCC GCCTACGGCG 
GACCACGACA AGTGGGTTGA GTCCGTCAAG 
CGAGTACATC CCGACCCCGA CCCACGATCT 
5 CGATCGAAGA TGTGTTCACC ATCTCCGGCC 
CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC 
CATCCGTCCG ACCCAGACCA CCACCGTCAC 
AGCAGATGGA CGAGTGCGAG GCTGGCGACA 
GGCATCAACC GTGACCAGGT CGAGCGTGGC 
10 CTCCGTGACC CCGCACACCA AGTTCGAGGG 
AGGACGAAGG CGGCCGTCAC TCGCCGTTCT 
TTCTACTTCC GTACCACCGA CGTCACCGGC 
CGTTGAGATG GTGCAGCCGG GCGATCACGC 
TCCAGCCGAT CGCTATGGAA GAGGGCCTGA 

15 



AAACGGCTTC GATCGCGATT 200 

CGCTGCACGA TGACGCTCCG 250 

GAACTCATGA AGGCCGTCGA 3 00 

GGACAAGCCG TTCCTGATGC 350 

GTGGCACCGT GGTTACCGGC 4 00 

TCCAACGTTG AGATCGTCGG 4 50 

CTCCATCGAG ACCTTCCACA 500 

ACACCGGTCT GCTGCTCCGC 550 

CAGGTTCTGG CTGCTCCGGG 600 

CGAAGT CTAC GTGCTGACCA 650 

TCTCCAACTA CCGTCCGCAG 7 00 

GTCATCACCC TGCCGGAAGG 750 

TACCTTCGGC GTTGAGCTGA 8 00 

CCTTCGCAG 839 



2) INFORMATION FOR SEQ ID NO: 15 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Brucella, abortus 
30 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

TGGCGCGATC CTGGTGGTTT CGGCTGCTGA CGGCCCGATG CCGCAGACCC 50 
35 GCGAGCACAT CCTGCTTGCC CGTCAGGTTG GCGTTCCGGC GATCGTCGTG 100 
TTCCTCAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 
TGAACTGGAA GTGCGCGAAC TTCTGTCGAA GTACGAATTC CCCGGCGACG 200 
AAATCCCGAT CAT CAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCC 250 
AAGGAACTGG GCGAAGATGC CATCCGCAAC CTGATGGACG CGGTTGACAG 3 00 

40 CTACATTCCG ACCCCGGAAC GCCCGATCGA CCAGCCGTTC CTGATGC CGA 35 0 

TCGAAGACGT GTTCTCGATC TCCGGCCGTG GTACGGTTGT GACGGGTCGC 400 

UTTUAtiCticCJ CiTATCCiTTAA CiUU-C^UXiAA U/^^l^GAAA TCGuTJ^CAU 7 4b'u"* 

CAAGGCGACG ACGAAGACCA CGGTTACCGG CGTTGAAATG TTCCGCAAGC 50 0 

TGCTCGACCA GGGCCAGGCT GGCGACAACA TTGGCGCGCT GATCCGCGGC 550 

45 GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TTAAGGCAGA AGCCTATATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTCA CCAACTACCG TCCGCAGTTC 700 

TACTTCCGTA CGACGGACGT GACGGGTGTT GTGACGCTTC CGGCTGGCAC 750 

GGAAATGGTC ATGCCTGGCG ATAACGTCGC CATGGACGTT ACCCTGATCG 800 

50 TGCCGATCGC CATGGAAGAG AAGCTTCGCT TCGCTATC 83 8 



2) INFORMATION FOR SEQ ID NO: 16 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buxrkholderia cepacia. 
<B) STRAIN: LSPQ 2217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



10 GGCAGCAGAC 
GTCAGGTTGG 
GTGGACGACG 
CCTGTCGAAG 
CGGCGAAGCT 

15 ATCATGAGCC 
TGCAGTTGAC 
CGGGCCGCGG 
GTCGGCGAAG 
CTGCACGGGC 

20 GCGACAACGT 
CGTGGCCAGG 
CACGGCTGAA 
CGTTCTTCAA 
ACGGGCTCGA 

25 CAACGTGTCG 



GGCCCGATGC 
CGTTCCGTAC 
CCGAACTGCT 
TACGACTTCC 
GGCGCTGGAA 
TGGCCGACGC 
GGCGCGTTCC 
TACGGTGGTG 
AAATCGAAAT 
GTTGAAATGT 
TGGTATCCTG 
TTCTGGCGAA 
GTGTACGTGC 
CAACTACCGT 
TCGAGCTGCC 
ATCACGGTGA 



CGCAAACGCG 
ATCATCGTGT 
CGAGCTGGTC 
CGGGCGACGA 
GGCGACACGG 
GCTGGACACG 
TGATGCCGGT 
ACGGGTCGTG 
CGTCGGTATC 
TCCGCAAGCT 
CTGCGCGGCA 
GCCGGGTTCG 
TGAGCAAGGA 
CCGCAGTTCT 
GAAGGACAAG 
A 



TGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
CACGCCGATC 
GCGAGCTGGG 
TACATCCCGA 
GGAAGACGTG 
TCGAGCGCGG 
AAGCCGACGG 
GCTGGACCAA 
CGAAGCGTGA 
ATCACGCCGC 
CGAAGGCGGC 
ACTTCCGTAC 
GAAATGGTGA 



CTGCTGGCGC 50 

GTGCGACATG 100 

TTCGCGAACT 150 

GTGAAGGGTT 200 

CGAAGTGGCG 250 

CGCCGGAGCG 3 00 

TTCTCGATCT 350 

CATCGTGAAG 4 00 

TGAAGACGAC 450 

GGTCAAGCAG 50 0 

AGACGTGGAG 550 

ACACGCACTT 600 

CGTCACACGC 65 0 

GACGGACGTG 700 

TGCCGGGCGA 750 
771 
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2) INFORMATION FOR SEQ ID NO: 17 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

45 GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCCCAG TACGACTTCC CGGGCGACGA 20 0 

TACTCCAATC GTT CGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 25 0 

50 AGTGGGAAGC TAAAATCGTT GAGCTGGCTG GCTACCTGGA TTCTTACATC 3 00 

CCTGAGCCAG AGCGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

55 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGT AT CAAAC 55 0 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCAGG CTCTATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGC ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

60 GTAATGCCTG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 
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CGCGATGGAT GACGGTCTGC GTTTCGCAA 



5 2 ) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neterl 

(B) STRAIN; ATCC 33 855 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



CGCTATCCTG GTTGTTGCTG CGACTGACGG CCCTATGCCT CAGAC CCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGATGACA 200 

25 CTCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAAGCAGAG 250 

TGGGAAGCTA AAATYGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 3 00 

AGAACCAGTA CGTGCAATCG AYCTGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 450 

30 TGCGAAATCT ACCTGTACCG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

35 ACAACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCG 824 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 19 



{ i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH.: 827 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea. lapagel 

(B) STRAIN: ATCC 33432 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CGCTATTCTG GTTGTTGCTG CAACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGC CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTAGA 150 

60 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCA GGCGATGATA 200 
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CCCCAATCAT CCGTGGTTCT GCTCTGAAAG 
TGGGAAGCTA AAATCGTTGA GCTGGCTGGC 
AGAACCAGTA CGTGCAATCG ACCTGCCGTT 
TATTCTCCAT CTCCGGCCGT GGTACCGTTG 
GGTATCGTTA AAGTGGGCGA AGAAGTAGAA 
TGCGAAATCT ACCTGTACTG GCGTTGAAAT 
AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC 
GAAGAAATCG AACGTGGTCA GGTTCTGGCT 
GCACACCAAG TTCGAATCTG AAGTGTACAT 
GCCGTCATAC TCCGTTCTTC AARGGCTACC 
ACCACTGACG TGACCGGTAC CATCGAACTG 
AATGCCAGGT GACAACATCA AAATGGTTGT 
CGATGGACGA CGGTCTGCGT TTCGCAA 



CGCTGGAAGG CGAAGCAGAG 250 

TTCCTGGATT CCTACATCCC 300 

CCTGCTGCCA ATCGAAGACG 350 

TKACCGGTCG TGTAGAGCGC 400 

ATCGTTGGTA TCAAAGATAC 450 

GTTCCGCAAA CTGCTGGACG 500 

TGCTGCGTGG TATCAAACGT 55 0 

AAGCCAGGCT CTATCAAGCC 60 0 

CCTGTCCAAA GACGAAGGCG 650 

GTCCACAGTT CTACTTCCGT 700 

CCAGAAGGCG TAGAGATGGT 750 

TACCCTGATC CACCCAATCG 800 

827 



2) INFORMATION FOR SEQ ID NO: 20 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

GCGGAGCTAT CCTAGTCGTT TCAGCTACAG ACGGAGCTAT GCCACAAACT 50 

AAAGAACATA TCTTGCTAGC TCGCCAGGTT GGAGTTCCTT ATATCGTTGT 100 

TTTCTTGAAT AAAGTAGATA TGATCTCTCA AGAAGATGCT GAACTTATTG 150 

ACCTTGTTGA GATGGAACTT AGTGAGCTTC TTGAAGAAAA AGGCTACAAA 200 

GGATGCCCTA TTATCCGTGG TTCTGCTTTG AAAGCTCTTG AAGGTGATGC 250 

AAATTATATC GAAAAAGTTC GAGAACTTAT GCAAGCTGTG GATGACAACA 3 00 

TCCCTACACC AGAAAGAGAA ATTGATAAGC CTTTCTTAAT GCCTATCGAA 35 0 

GACGTATTCT CAATCTCTGG TCGTGGTACT GTGGTTACAG GAAGAATCGA 4 00 

GCGTGGAATC GTTAAAGTTT CTGATAAAGT TCAGCTCGTG GGATTAGGAG 450 

AGACTAAAGA AACAATCGTT ACTGGAGTCG AAATGTTCAG GAAAGAACTT 50 0 

CCTGAAGGTC GTGCAGGAGA AAACGTTGGT TTACTCCTCA GAGGTATTGG 550 

AAAGAACGAT GTTGAAAGAG GTATGGTGGT TTGTCAGCCT AACAGCGTGA 600 

AGCCTCATAC GAAATTTAAG TCAGCTGTTT ACGTTCTTCA GAAAGAAGAA 650 

GGCGGACGTC ATAAGCCTTT CTTCAGCGGA TACAGACCTC AGTTCTTCTT 70 0 

CCGTACTACA GACGTGACAG GAGTCGTAAC TCTTCCTGAA GGAACTGAAA 750 

TGGTAATGCC TGGAGATAAC GTTGAGCTTG ATGTTGAGCT CATTGGAACA 800 

GTTGCTCTTG AAGAAGGAAT GAGATTTGCA A 831 



2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia psittaci 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 21 



TGGAGCGATT 
AAGAACATAT 

10 TTCCTTAACA 
CTTAGTTGAA 
GTTGCCCAAT 
AGCTACGTTG 
CCCTACTCCA 

15 ACGTATTCTC 
CGTGGAATCG 
TACTAGAGAG 
CAGAAGGTCA 
AAGAATGACG 

20 ATCTCACACA 
GTGGACGTCA 
CGTACAACAG 
GGTTATGCCA 
TAGCTCTAGA 

25 



CTCGTTGTTT 
TCTTTTGGCG 
AAATCGATAT 
ATGGAATTGT 
TATCCGTGGT 
AAAAAATTCG 
GAGCGTGAAG 
TATTTCTGGT 
TTAAAGTGGG 
ACAATTGTTA 
AGCAGGGGAA 
TTGAACGTGG 
CAATTTAAAG 
TAAAC CTTTC 
ATGTTACAGG 
GGCGATAACG 
AGAAGGTATG 



CCGCTACTGA 
AGACAGGTTG 
GATTTCTCAA 
CCGAACTTCT 
TCTGCTTTGA 
CGAGTTAATG 
TTGATAAGCC 
CGTGGTACTG 
TGATAAAGTA 
CCGGTGTGGA 
AACGTTGGTT 
TATGGTTATC 
GTGCTGTCTA 
TTTACCGGAT 
TGTTGTAACT 
TTGAATTCGA 
AGATTT 



CGGTGCGATG 
GTGTTCCTTA 
GAAGATGCTG 
AGAAGAAAAA 
AAGCCTTAGA 
CAAGCAGTGG 
TTTCTTAATG 
TGGTCACAGG 
CAGATTGTTG 
AATGTTCAGA 
TGCTCCTCAG 
TGCCAACCTA 
CATTCTACAA 
ACAGACCTCA 
CTCCCAGAAG 
AGTTCAATTA 



CCTCAGACCA 50 

CATCGTTGTT 100 

AGCTCGTAGA 150 

GGTTATAAAG 2 00 

AGGTGATGCA 250 

ATGAT AACAT 3 00 

CCTATCGAAG 350 

ACGTATCGAG 400 

GTTTAAGAGA 450 

AAAGAACTTC 500 

AGGTATCGGT 550 

ATAGCGTGAA 600 

AAAGAAGAGG 650 

GTTCTTCTTC 700 

GTACAGAGAT 750 

ATTAGCCCAG 800 
826 



2) INFORMATION FOR SEQ ID NO : 22 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
40 (B) STRAIN: LGV 12 

(xi) SEQUENCE DESCRIPTION": " SEQ~ ID~ NO :~ 22* ~ 

GGGGCTATTC TAGTAGTTTC TGCAACAGAC GGAG CTATGC CTCAAACTAA 50 

45 AGAGCATATT CTTTTGGCAA GACAAGTTGG GGTTCCTTAC ATCGTTGTTT 100 

TTCTCAATAA AATTGACATG ATTTCCGAAG AAGACGCTGA ATTGGTCGAC 150 

TTGGTTGAGA TGGAGTTGGC TGAGCTTCTT GAAGAGAAAG GATACAAAGG 200 

GTGTCCAATC AT CAGAGGTT CTGCTCTGAA AGCTTTGGAA GGGGATGCTG 250 

CATACATAGA GAAAGTTCGA GAGCTAATGC AAGCCGTCGA TGATAATATC 300 

50 CCTACTCCAG AAAGAGAAAT TGACAAGCCT TTCTTAATGC CCATTGAGGA 350 

CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA CGTATTGAGC 400 

GTGGAATTGT TAAAGTTTCC GATAAAGTTC AGTTGGTCGG TCTTAGAGAT 450 

ACTAAAGAAA CGATTGTTAC TGGGGTTGAA ATGTTCAGAA AAGAACTCCC 500 

AGAAGGTCGT GCAGGAGAGA ATGTTGGATT GCTCCTCAGA GGTATTGGTA 5 50 

55 AGAACGATGT GGAAAGAGGA ATGGTTGTTT GCTTGCCAAA CAGTGTTAAA 600 

CCTCATACAC GGTTTAAGTG TGCTGTTTAC GTTCTGCAAA AAGAAGAAGG 650 

TGGACGACAT AAGCCTTTCT TCACAGGATA TAGACCTCAA TTCTTCTTCC 700 

GTACAACAGA CGTTACAGGT GTGGTAACTC TGCCTGAGGG AGTTGAGATG 750 

GTCATGCCTG GGGATAACGT TGAGTTTGAA GTGCAGTTGA TTAGCCCTGT 800 

60 GGCTTTAGAA GAAGGTATGA GA 822 
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2 ) INFORMATION FOR SEQ ID NO : 23 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Chryseobacterium wenlngosepticum 

(B) STRAIN: CDC B7681 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 23 

20 CGGAGCTATC TTAGTATGTG CTGCTACAGA TGGTCCAATG CCTCAAACTA 50 

GAGAACACAT CCTACTTTGC CGTCAGGTAA ACGTACCTAG AATTGTTGTG 100 

TTCATGAACA AAGTTGACAT GGTAGATGAT CCAGAATTGT TAGAGCTTGT 150 

TGAG CTTGAA CTTAGAGATC TATTATCTAC TTACGAATAT GATGGTGATA 200 

ACTCTCCAGT AATTCAAGGT TCTGCTCTTG GTGCTCTTAA CGGTGATGCT 250 

25 AAGTGGGTAG CTACTGTAGA AGCTCTAATG GATGCTGTTG ATACTTGGAT 300 

CGAGCAACCA GTAAGAGATT CTGATAAGCC ATTCCTTATG CCAATCGAAG 350 

ACGTATTCTC TATTACAGGT AGAGGTACTG TAGCAACTGG TAGAATCGAG 400 

GCTGGTGTAA TCAACACAGG TGATCCTGTT GACATCGTAG GTATGGGTGA 450 

CGAGAAGTTA ACTTCTACTA TTACAGGTGT TGAGATGTTT AGAAAAATCC 500 

30 TAGACAGAGG TGAAGCTGGT GATAACGTAG GTCTATTGTT GAGAGGTATT 550 

GAAAAGACTG ACATCAAGAG AGGTATGGTT ATCGCTAAGA AAGATTCAGT 600 

TAAGCCACAC AAGAAATTCA AAGCTGAGGT TTATATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCCACAACA AATACCGTCC TCAGTTCTAT 700 

GTAAGAACTA CTGACGTTAC AGGTGAAATC TTCTTACCAG AAGGTGTAGA 750 

35 AATGGTAATG CCTGGTGATA ACTTAACTAT CACTGTAGAA TTGTTACAAC 800 

CAATCGCTCT TAACGAGGGT CTTAGATTCG CGATC 835 



4 0 2) INFORMATION FOR SEQ ID NO: 24 

( i ) SEQUENCE CHARACTER I ST I CS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonaticus 

(B) STRAIN: ATCC 2 5405 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

60 ACACCCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 2 50 
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GAGTGGGAAG CGAAAATCAT CGAACTGGCC 
CCCGGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT 
5 GACTGCGAAG TCTACCTGTA CTGGCGTTGA 
ACGAAGGCCG TGCGGGTGAG AACGTTGGTG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG 
GCCGCACACC ATGTTCGAAT CYGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
10 CGTACAACTG ACGTGACTGG CACCATCGAA 
GGTAATGCCG GGCGACAACA TCAAAATGGT 
TCGCGATGGA CGACGG 



GGCTTCCTGG ATTCTTACAT 3 00 

GTTCCTGCTG CCGAT CGAAG 350 

TTGTTACCGG TCGTGTAGAA 400 

GAAATCGTTG GTATCAAAGA 450 

AATGTTCCGC AAACTGCTGG 50 0 

TTCTGCTGCG TGGTATCAAA 550 

GCTAAGCCGG GCWCCATCAA 600 

CATCCTGTCC AAAGACGAAG 650 

ACCGTCCGCA GTTCTACTTC 70 0 

CTGCCGGAAG GCGTTGAGAT 750 

TGTTACCCTG ATCCACCCGA 800 

816 
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2) INFORMATION FOR SEQ ID NO : 25 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter braakii 

(B) STRAIN: ATCC 43162 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 



CGCGATCCTG GTTGTTGCTG CAACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGY CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

35 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAWGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCT ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 40 0 

4 0 GGTATCATCA AAGTTGGTGA AGAAGTTGAA ATCGTTGGTA TCAARGACAC 450 

TGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTT C TGCTGCGTGG TATCAAg'cGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGTAC CATCGAACTG CCGGAAGGCG TTGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCGC 825 



2) INFORMATION FOR SEQ ID NO : 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

17 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koserx 

(B) STRAIN: ATCC 27156 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCT 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 40 0 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATYGTTG GTATCAAAGA 450 

GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GYTCCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATYCTGTCY AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 

2 5 



2) INFORMATION FOR SEQ ID NO: 27 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter farmer i 
40 (B) STRAIN: ATCC 51112 

(XI) SEQlffii^E^ ~ SEQ~ ID~ NOT 27~ ~~ 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

45 AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

GATGGAAGTT CGTGAACTGC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCAGGC TTCCTGGATT CTTACATCCC 300 

50 GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCATCA AAGTGGGTGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

TGCCAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 50 0 

AAGGCCGTGC TGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

55 GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCW CCATCAAGCC 600 

RCACACTATG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACGACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGTG TTGAGATGGT 750 

TATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCGATCG 800 

60 CGATGGACGA CGGTCTGCGT TTCGCAA 827 
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2) INFORMATION FOR SEQ ID NO: 28 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY; Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Ci trobacter freundll 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

20 CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 10 0 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACTCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGAAGC AGAGTGGGAA 250 

25 GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCAGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCTATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 40 0 

ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTGCTAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

3 0 GTGCTGGTGA GAACGTTGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCTCTATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGACGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

35 GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCGA 797 



40 



2 ) INFORMATION FOR SEQ ID NO: 29 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Ci trobacter sedlakli 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

55 CGGCGCGATC CTGGTTGTTG CCGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAGATGGAA GTTCG TGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

60 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 
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TCCGGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT 
GACTGCGAAG TCTACCTGTA CTGGCGTTGA 
5 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACAACTG ACGTGACTGG CACCATCGAA 
10 GGTAATGCCG GGCGACAACA TCAAAATGGT 
TCGCGATGGA CGACGGTCTG CGTTTC 



GTTCCTGGTG CCGATCGAAG 3 50 

TTGTTACCGG TCGTGTAGAG 4 00 

GAAATCGTTG GTATCAAAGA 4 50 

AATGTTCCGC AAACTGCTGG 5 00 

TTCTGCTGCG TGGTATCAAA 550 

GCGAAGCCGG GCACCATCAA 6 00 

TATTCTGTCC AAAGATGAAG 65 0 

ACCGTCCGCA GTTCTACTTC 700 

CTGC CGGAAG GCGTAGAGAT 750 

TGTTACCCTG ATCCACCCGA 800 

826 



15 2) INFORMATION FOR SEQ ID NO : 30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE; Genomic DNA 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trohacter werkmanii 

(B) STRAIN: ATCC 51114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

30 

GCGATCCTGG TTGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GATTTCCCGG GCGACGACAC 20 0 

35 TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GAAGCAGAGT 250 

GGGAAGCGAA AATCATCGAA CTGGCTGGCT TTCTGGATTC TTACATCCCG 30 0 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTRCCTA TCGAAGACGT 350 

ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 40 0 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGACACC 450 

40 GCTAAGTCTA CCTGTACCGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 50 0 

AGGCCGTGCT GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCTC TATCAAGCCG 60 0 

CACACCAAGT TCGAATCTGA AGTGTACATC CTGTCCAAAG ACGAAGGCGG 650 

C CGT CAT ACT CCGTTCTTCA AAGGCTACCG T CCGCAGTTC TACTTCCGTA 700 

45 CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATYAA AATGGTTGTT ACYCTGAT CC ACCCGATCGC 8 00 

GATGGACGAC GGTCTGCGTT TCG 823 



50 

2 ) INFORMATION FOR SEQ ID NO : 31 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

20 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 2 9935 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCAT CGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

10 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 2 00 

TACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 2 50 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCGGAACCAG AACGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACTGGT CGTGTAGAAC 400 

15 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCCAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 5 00 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

20 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACGGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 7 50 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCGATGGAT GACGGTCTGC GTTTCG 826 



25 



40 



2) INFORMATION FOR SEQ ID NO : 32 



( i ) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



CGGAGCTATA TTAGTTTGTT CAGCAGCTGA TGGTCCAATG CCTCAAACAA 50 

GAGAGCACAT CTTATTATCA TCAAGAGTTG GAGTTGACCA CATCGTAGTA 100 

45 TTCTTAAACA AAGCAGATAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTAGAGAGT TATTAAGCGA GTACAACTTC CCAGGAGACG 200 

AYATTC CAGT AAT CAARGG A TCAGCTTTAG TAGCATTAGA AAACCCAACT 250 

GACGAAGCTG CAACAGCTTG TATCAGAGAG TTAATGGATG CTGTAGATAG 30 0 

CTACATCCCA ACACCAGAAA GAGCAACAGA TAAGCCATTC TTAATGC CAG 350 

50 TAGAGGACGT ATTCACAATC ACTGGTAGAG GAACAGTTGC AACAGGAAGA 400 

GTTGAAAGAG GAGTTCTACA TGTAGGAGAC GAAGTAGAAG TAATCGGATT 450 

AACTGAAGAA AGAAGAAAAA CTGTTGTAAC AGGAATCGAA ATGTTCAGAA 500 

AGTTATTAGA TGAAGCACAA GCTGGAGATA ACATCGGAGC ATTATTAAGA 550 

GGTATCCAAA GAACTGAYAT CGAAAGAGGT CAAGTTTTAG CTCAAGTTGG 600 

55 AACAATCAAC CCACACAAAA AATTCGTAGG TCAAGTATAC GTACTTAAAA 650 

AAGAAGAAGG TGGAAGACAT ACTCCATTCT TCGATGGATA CAGACCACAA 700 

TTCTACTTCA GAACAACAGA CGTTACAGGA TCAATCAAAT TACCAGAAGG 750 

AATGGAAATG GTTATGCCTG GAGACCACAT CGACATGGAA GTTGAATTAA 8 00 

TCACAGAAAT CGCTATGGAY GAAGGATTAA GATTCGCTAT C 841 



60 
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2) INFORMATION FOR SEQ ID NO: 33 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 
15 (B) STRAIN: ATCC 15668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGGCGCCATC CTGGTGTGCT CGGCCGCTGA CGGCCCCATG CCCCAGACCC 50 

2 0 GCGAGCACAT CCTGCTGGCC CGTCAGGTGG GCGTGCCCTA CATCATCGTG 10 0 

TTCCTGAACA AGTGCGACAT GGTGGACGAC GAAGAGCTGC TGGAACTGGT 150 

CGAAATGGAA GTGCGCGAGC TGCTTGCCAA GTACGACTTC CCCGGCGACG 200 

ACACCCCCAT CATCCGCGGC TCGGCCAAGC TGGCCCTGGA AGGCGACCAG 250 

TCCGACAAGG GCGAACCTGC CATCCTGCGC CTGGCTGAAG CACTGGACTC 300 

25 CTACATCCCC ACGCCCGAGC GCGCTGTGGA CGGCGCCTTT GCAATGCCCG 350 

TGGAAGACGT GTTCTCGATC TCTGGCCGTG GCACCGTGGT GACTGGCCGT 400 
ATCGAGCGCG GCATCATCAA GGTCGGCGAA GAAATCGAAA TCGTCGGTAT ' 450 

CCGCGACACC CAGAAG AC C A TCGTCACCGG CGTGGAAATG TTCCGCAAGC 50 0 

TGCTGGACCA AGGTCAAGCT GGCGACAACG TGGGTCTGCT GCTGCGCGGC 550 

30 ACCAAGCGTG AAGACGTGGA ACGCGGCCAA GTGCTGTGCA AGCCCGGCTC 600 

CATCAAGCCC CACACCCACT TCACGGCTGA GGTGTACGTG CTGTCCAAGG 650 

ACGAAGGTGG TCGCCACACT CCGTTCTTCA ACAACTACCG TCCCCAGTTC 700 

TATTTCCGTA CGACCGACGT GACCGGCTCC ATCGAGCTGC CCGCCGACAA 750 

GGAAATGGTG ATGCCTGGCG ACAACGTGTC GATCACCGTC AAGCTGATCG 800 

3 5 CCCCCATCGC CATGGAAGAA GG 822 



2) INFORMATION FOR SEQ ID NO: 34 

40 

{ i) SEQUENCE CHARACTERISTICS : 
IA)~ 4 LENGTH?" 70~2~ £>ases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE; Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Coryn eba cterium bovls 

(B) STRAIN: ATCC 7715 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

55 GCCGCAGACC CGTGAGCACG TCCTCCTGGC CCGTCAGGTC GGTGTGCCCT 50 

ACATCCTCGT CGCCCTCAAC AAGTGCGACA TGGTCGACGA CGAGGACCTC 10 0 

ATCGAGCTCG TCGAGATGGA GGTCCGTGAG CTCCTCGCCG AGCAGGACTA 150 

CGACGAGGAC GCCCCGATCA TCCACATCTC CGCCCTCAAG GCCCTCGAGG 200 

GTGACCCGGA GTGGACGCAG CGCATCGTCG ACCTCATGAA GGCCTGCGAC 250 

60 GACGCCATCC CGGATCCGGA GCGCGAGACG GACAAGCCGT TCCTCATGCC 3 00 

22 
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GATCGAGGAC ATCTTCACGA TCACCGGCCG 
GTGTCGAGCG TGGCATCCTC AACGTCAACG 
AT CTGCGAGA ACTCCCAGAA GACGACCGTC 
CAAGTTCCTC GACACGGCCG AGGCCGGCGA 
5 GTGGCCTGAA GCGCGAGGAC GTCGAGCGTG 
GGCGCCTACA CGCCGCACAC CGAGTTCGAG 
CAAGGACGAG GGTGGCCGCC ACACGCCGTT 
AGTTCTACTT CCGGACGACC GACGTCACCG 
GG 



CGGCACCGTC GTCACGGGCC 350 

AGGAGGTCGA GATCCTGGGT 400 

ACCTCCATCG AGATGTTCAA 450 

CAACGCCGCC CTGCTGCTCC 500 

GCCAGATCGT GGCCAAGCCG 550 

GGCTCCGTGT ACATCCTCTC 600 

CTTCGACAAC TACCGTCCGC 650 

GCGTCGTCAA GCTGCCGGAG 700 

702 



2) INFORMATION FOR SEQ ID NO: 35 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynehacterium cervicis 
25 (B) STRAIN: NCTC 10604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GGCTCAGACC CGCGAGCACG TTCTGCTTGC TCGCCAGGTT GGCGTTCCGA 5 0 

30 CGATCCTGGT TGCCCTCAAC AAGGCCGATA TGGTCGACGA TGAGGAAATG 10 0 

CTGGAGCTCG TTGAGGAAGA GTGCCGCGAC CTGCTCGAGT CCCAGGACTT 150 

CGATCGTGAC GCCCCGATCA TCCAGGTTTC CGCGCTGAAG GCTCTCGAAG 20 0 

GTGATCCGCA GTGGGTTGCT AAGGTCGAGG AGCTCATGGA GGCAGTCGAC 25 0 

ACCTTCGTGC CGACTCCTGA GCGCGACATG GACAAGCCGT TCCTCATGCC 3 00 

35 GATCGAAGAC GTCTTCACCA TCACCGGCCG TGGCACCGTT GTTACCGGTC 350 

GTGTTGAGCG TGGCAAGCTC CCGATCAACT CTGAGGTTGA AATCCTCGGT 4 00 

ATCCGCGAAC CGCAGAAGAC CACCGTTACC GGTATCGAGA TGTTCCACAA 4 50 

GTCCATGGAT GAAGCATGGG CAGGCGAGAA CTGTGGTCTC CTCCTGCGTG 5 00 

GCACCAAGCG CGATGAGGTT GAGCGCGGTC AGGTCGTTGC CGTTCCCGGT 550 

40 TCGATCACCC CGCACACCAA CTTCACCGGA CAGGTCTACA TCCTCAAGAA 600 

GGAAGAAGGC GGTCGTCACA ACCCGTTCTT CTCGAACTAC CGTCCGCAGT 650 

TCTACTTCCG CACCACGGAC GTGACCGGCG" TCATCACCC"" 68 9"" 



45 



55 



2) INFORMATION FOR SEQ ID NO: 3 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 804 bases 
50 (B) TYPE.: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium flav&scens 

(B) STRAIN: ATCC 10340 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 6 

23 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGTTGTTGCT GCAACCGATG GTCCTATGCC GCAGACCCGC GAG CACGTTC 50 

TTCTGGCTCG CCAGGTTGGC ■ GTTCCTTACA TCCTCGTTGC TCTTAACAAG 100 

TGCGACATGG TTGATGATGA GGAAATCATC GAGCTCGTTG AGATGGAAAT 150 

5 CCGCGAACTG CTCGCTGAGC AGGACTACGA CGAGGATGCC CCCATCATCC 2 00 

ACATCTCCGC TCTCAAGGCT CTTGAGGGTG ACGAGAAGTG GGTACAGGCC 250 

ATCGTCGACC TCATGCAGGC CTGCGATGAC TCCATTCCGG ATCCGGAGCG 3 00 

CGAGACCGAC AAGCCCTTCC TCATGCCTAT CGAGGACATC TTCACCATCA 3 50 

CCGGCCGCGG TACCGTTGTT ACCGGCCGTG TTGAGCGTGG CGTTTTGAAG 400 

10 GTCAACGAGG ATGTTGAGAT CATCGGCATC AAGGAGAAGT CCATCTCCAC 450 

CACCGTTACC GGTATCGAAA TGTTCCGCAA GATGATGGAC TACACCGAGG 500 

CTGGCGACAA CTGTGGTCTG CTTCTGCGTG GTACCAAGCG TGAAGAGGTC 550 

GAGCGCGGCC AGGTTGTTAT CAAGCCGGGC GCCTACACCC CCCACACCAA 600 

GTTCGAGGGT TCCGTCTACG TCCTCAAGAA GGAAGAGGGC GGCCGCCACA 650 

15 CCCCGTTCAT GGACAACTAC CGTCCGCAGT TCTACTTCCG TACCACTGAC 700 

GTGACCGGCG TTGTTCACCT GCCTGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACGTT GATATGACCG TTGAGCTCAT CCAGCCCGTC GCTAGGATGA 800 

GGGC 804 

20 

2 ) INFORMATION FOR SEQ ID NO.: 3 7 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium kutscherl 

(B) STRAIN: ATCC 15677 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 

TGCCTCAGAC CCGTGAGCAC GTTCTTCTTG CTCGCCAGGT TGGCGTTCCT 50 

TACATCCTCG TTGCTCTTAA CAAGTGCGAC ATGGTTGACG ATGAGGAAAT 100 

40 CAT CGAGCTC GTTGAGATGG AAGTTCGCGA GCTTCTTGCT GAGCAGGAGT 150 

ACGATGAAGA GGCTCCAATC ATCCACATCT CTGCTTTGAA GGCTCTTGAG 200 

GGCGACGAGA AGTGGAC"tCA" GGCC^TCATc" GA'cCT^TGc"' AG~G CTTGTGA " 2 5 0~ " 

TGACTCCATC CCAGATCCAG AGCGTGAGAC CGACAAGCCA TTCCTCATGC 3 00 

CTATCGAGGA TATCTTCACC AT CAC CGGTC GTGGCACCGT TGTTACCGGT 350 

45 CGTGTTGAGC GCGGTTCCTT GAAGGTGAAT GAGGACGTCG AGATCATCGG 4 00 

CATCAAGGAG AAGTCCACCA CTACTACCGT TACCGGTATC GAAATGTTCC 450 

GTAAGCTTCT TGATTACACC GAAGCTGGCG ATAACTGTGG TCTGCTTCTT 500 

CGTGGTATCA AGCGCGAAGA CGTTGAGCGT GGTCAGGTTG TTGTTAAGCC 550 

AGGCGCTTAC ACACCTCACA CCGAGTTCGA GGGCTCTGTT TACGTTCTTT 600 

50 CCAAGGACGA GGGCGGCCGC CACACCCCAT TCTTCGACAA CTACCGTCCA 650 

CAGTTCTACT TCCGCACCAC TGACGTTACC GGTGTTGTGA AG 6 92 



55 2) INFORMATION FOR SEQ ID NO: 38 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

24 
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(D> TOPOLOGY: Linear 
( i i ) MOLECULE TYPE : Genomi c DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium minutissimum 

(B) STRAIN: ATCC 23348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



CCTGGTTGTT GCTGCAACCG ATGGCCCGAT GCCGCAGACC CGCGAGCACG SO 

TTCTTCTGGC CCGCCAGGTT GGCGTTCCGT ACATCCTCGT TGCACTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAAATC ATCGAGCTCG TTGAGATGGA 150 

GATCCGTGAG CTGCTCGCTG AGCAGGACTA CGACGAGGAA GCTCCGATCG 200 

15 TTCACATCTC CGCTCTGAAG GCTCTTGAGG GCGACGAGAA GTGGGCACAG 250 

TCCATCGTTG ACCTGATGCA GGCTTGCGAT GACTCCATCC CGGATCCGGA 3 00 

GCGCGAGCTG GACAAGC CGT TCCTGATGCC GATCGAGGAC ATCTTCACCA 3 50 

TTACCGGCCG CGGTACCGTT GTTACCGGCC GTGTTGAGCG TGGCTCCCTG 4 00 

AACGTTAACG AGGACATCGA GATCATCGGT ATCAAGGACA AGTCCATGTC 4 50 

20 CACCACCGTT ACCGGTATCG AGATGTTCCG CAAGATGATG GACTACACCG 500 

AGGCTGGCGA CAACTGTGGT CTGCTTCTGC GTGGTACCAA GCGTGAAGAG 550 

GTTGAGCGTG GCCAGGTTTG CATCAAGCCG GGCGCTTACA CCCCGCACAC 6 00 

CAAGTTCGAG GGTTCCGTCT ACGTCCTGAA GAAGGAAGAG GGCGGCCGCC 650 

ACACCCCGTT CATGGACAAC TACCGTCCGC AGTT CTACTT CCGCACCACC 700 

25 GACGTCACCG GTGTCATCAA GCTGCCGGAG GGCACCGAGA TGGTCATGCC 750 

GGGCGACAAC GTTGAGATGT CCGTAGAGCT GATCCAGCCG GTCGCTA 797 



3 0 2 ) INFORMATION FOR SEQ ID NO : 3 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 
35 <C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium mycetoides 
(B J STRAIN : ATCC 21134 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 9 



GCCGCAGACC CGCGAGCACG TTCTTCTGGC CCGCCAGGTC GGCGTCCCCT 50 

ACATCCTCGT TGCGCTGAAC AAGTGCGACA TGGTTGATGA TGAGGAGATC 100 

ATCGAGCTCG TGGAGATGGA GGTC CGTGAG CTGCTCGGCG AGCAGGACTA 150 

CGACGAGGAC GCCCCCATCA TCCACATCTC CGCTCTGAAG GCTCTCGAGG 200 

50 GCGACGAGAA GTGGGTTCAG TCCGTGCTCG ACCTCATGCA GGCGTGCGAC 250 

GACTCCATCC CGGATCCGGT CCGCGAGACC GACCGCGACT TCCTGATGCC 3 00 

GATCGAGGAC ATCTTCACCA TCTCCGGCCG CGGCACCGTG GTTACCGGTC 350 

GTGTGGAGCG CGGCGTGCTC AACCTCAACG ACGAGGTCGA GATCATCGGC 400 

ATCCGCGACA AGTCCCAGAA GACCACCGTC ACCTCCATCG AG ATGTT CAA 4 50 

55 CAAGCTGCTC GATACCGCTG AGGCAGGCGA CAACGCGGCT CTGCTGCTCC 5 00 

GCGGTCTGAA GCGCGAGGAC GTCGAGCGTG GCCAGGTTGT CATCAAGCCG 550 

GGCGCCTACA CCCCGCACAC CAAGTTCGAG GGTTCCGTCT ACGTCCTGTC 600 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTT CTACTT CCGCACCACC GACGTGACCG GTGTTGTGAA GCTGCCGGAG 700 

60 GG 702 

25 
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2 ) INFORMATION FOR SEQ ID NO: 40 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 674 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Corynebacterium pseudogenitalium 

(B) STRAIN: ATCC 3 3038 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

20 GCTCGCCAGG TTGGCGTTCC TTACATC CTC GTTGCGCTGA ACAAGTGCGA 50 

CATGGTTGAT GATGAGGAAA TCATCGAGCT CGTTGAGATG GAGATCCGTG 10 0 

AGCTGCTCGC AGAGCAGGAT TACGATGAGG AAGCTCCTAT CGTTCACATC 150 

TCCGCTCTGA AGGCCCTCGA GGGCGATGAC AAGTGGGTAC AGTCCGTCGT 200 

TGATCTGATG GAAGCCTGCG ACAACTCCAT CCCGGATCCG GAGCGCGCTA 250 

25 CCGACCAGCC GTTCCTGATG CCTATCGAGG ACATCTTCAC CATTACCGGC 30 0 

CGCGGTACCG TTGTTACCGG CCGTGTTGAG CGTGGCCGTC TGAACGTCAA 350 

CGAGGACGTT GAGATCATCG GTATCCAGGA GAAGTCCCAG ACCACCACCG 400 

TTACCGGTAT CGAGATGTTC CGCAAGATGA TGGACTACAC CGAGGCTGGC 450 

GACAACTGTG GTCTGCTTCT GCGTGGTACC AAGCGTGAGG ACGTTGAGCG 500 

3 0 TGGCCAGGTT GTTATCAAGC CGGGCGCTTA CACCCCGCAC ACCAAGTTCG 550 

AGGGCTCCGT CTACGTCCTG AAGAAGGAAG AGGGCGGCCG CCACACCCCG 600 

TTCATGAACA ACTACCGTCC GCAGTTCTAC TTCCGTACCA CGGACGTTAC 650 

CGGTGTTGTT CACCTGCCAG AGGG 674 



35 



50 



2) INFORMATION FOR SEQ ID NO: 41 



( i ) SEQUENCE CHARACTERISTICS : 
4 0 (A) LENGTH: 694 bases 

(B) TYPE :__ Nucleic acid 

( C) " STRANDEDNE SS": " Double" 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium renale 

(B) STRAIN: ATCC 19412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TGCCTCAGAC CCGTGAGCAC GTT CTGCTTG CTCGTCAGGT CGGCGTTCCT 50 

TACATCCTCG TTGCACTGAA CAAGTGCGAC ATGGTCGACG ACGAAGAAAT 100 

55 CATCGAGCTC GTCGAGATGG AAATCCGTGA ACTGCTCGCA GAGCAGGACT 150 

ACGATGAGGA AGCTCCTATC GTTCACATCT CCGCTCTGGG CGCCCTGAAC 200 

GGCGAGCAGA AGTGGGTTGA CTCCATCGTC GAACTGATGG AAGCTTGCGA 250 

CAACTCCATC CCAGACCCAG TTCGCGACAT CGACCACCCA TTCCTGATGC 300 

CTATCGAGGA CATCTTCACC ATTACCGGTC GCGGTACCGT TGTTACCGGC 350 

6 0 CGTGTCGAGC GTGGCCGTCT CAACGTCAAC GAAGAAGTTG AGATCATCGG 400 

26 
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TATCAAGGAC AAGTCCCAGA AGACCACCGT 
GCAAGATGCT GGACTACACC GAAGCTGGCG 
CGCGGCATCG GCCGTGAGGA TGTCGAGCGT 
AGGCGCTTAC ACCCCTCACT CTGAGTTCGA 
CCAAGGACGA GGGTGGCCGC CACACCCCAT 
CAGTTCTACT TCCGCACCAC CGACGTGACC 



CACCGGTATC GAGATGTTCC 450 

ACAACTGTGG TCTGCTGCTC 500 

GGCCAGGTTA TCATCAAGCC 550 

GGGCTCTGTC TACGTCCTGT 600 

TCTTCGACAA CTACCGTCCA 650 

GGCGTTGTGC ACCT 694 



10 2) INFORMATION FOR SEQ ID NO: 42 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 687 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterlum ulcerans 

(B) STRAIN: NCTC 8665 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



GCCGCAGACC CGCGAGCACG TTCTGCTGGC TCGCCAGGTT GGCGTTCCKT 5 0 

ACATCCTSGT TGCACTGAAC AAGTGCGACA TGGTTGACGA TGAGGARCTC 100 

CTSGAGCTCG TCGAGATGGA GGTCCGCGAG CTGCTGGCTG AGCAGGACTA 15 0 

CGACGAGGAA GCTCCGRTCG TTCACATCTC CGCWCTGAAC GCCCTGGACG 200 

30 GCGACSAGAA GTGGGCTVAC TCCATCCTCG AGCTGATGCA GGCTTGCGAC 250 

GAGTCCATCC CGGATCCGGA GCGCGAGACC GACAAGCCGT TCCTGATGCC 300 

GATTGAGGAC ATCTTCACCA TTACCGGTCG CGGYACCGTT GTTACCGGCC 3 50 

GTGTTGAGCG TGGCDTCCTG AACGTSAACG ACGASGTTGA GATCATGGGY 400 

ATCCGGGAGA AGTCCCAGAA GACCACCGTY ACCKSCATCG AGATGTTCAA 45 0 

35 CAAGMTGMTG GACWCCGCAG AGGCTGGCGA CAACGCTGSW CTGCTGCTGC 50 0 

GTGGTMTSAA GCGTGAGGAC GTTGAGCGTG GCCAGATCAT CGYTAAGCCG 550 

GGCGCKTACA CCCCGCACAC CGAGTTCGAG GGCTCCGTCT ACGTCCTGTC 60 0 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 65 0 

AGTTCTACTT CCGCACCACC GACGTS AC CG GTGTTGT 687 

40 



2) INFORMATION FOR SEQ ID NO : 43 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coryneba.ct ex*± um urealyticum 
55 (B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC GTGAGCACGT 50 
60 TCTGCTGGCT CGCCAGGTTG GCGTTCCGTA CATCCTCGTT GCACTGAACA 100 

27 
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AGTGCGACAT GGTTGACGAT GAGGAGCTCC TCGAGCTCGT CGAGATGGAG 150 

GTCCGCGAGC TTCTGGCTGA GCAGGACTAC GACGAGGAGG CTCCGGTCGT 2 00 

CCCGATCTCC GCACTGGGCG CCCTGGACGG CGATCAGAAG TGGGTCGACT 250 

CCATCCTCGA GCTCATGAAG GCTTGCGACG AGTCCATCCC GGACCCGGAG 3 00 

5 CGCGAGACCG ACAAGCCGTT CCTGATGCCG GTTGAGGACA T CTTC AC CAT 3 50 

TACCGGTCGC GGCACCGTCG TTACCGGCCG TGTTGAGCGT GGCGT CCTGA 4 00 

ACCTGAACGA CGAGGTCGAG ATCCTGGGCA TCCGCGAGAA GTCCACCAAG 450 

ACCACCGTCA CCTCCATCGA GATGTTCAAC AAGCTG CTGG ACACCGCAGA 500 

GGCTGGCGAC AACGCTGCAC TGCTGCTGCG TGGTCTGAAG CGTGAGGACG 550 

10 TCGAGCGAGG CCAGATCATC GCTAAGCCGG GCGCTTACAC CCCGCACACC 6 00 

GAGTTCGAGG GCTCCGTCTA CGTCCTGTCC AAGGACGAGG GCGGCCGTCA 650 

CACCCCGTTC TTCGACAACT ACCGTCCGCA GTTCTACTTC CGTACCACCG 700 

ACGTCACCGG TGTCGTTACC CTGCCAGAGG GCACCGACAT GGT CATGCCG 750 

GGCGACAACG TTGAGATGAG CGTCAAGC 778 

15 



2) INFORMATION FOR SEQ ID NO: 44 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium xerosis 
3 0 (B) STRAIN: ATCC 373 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

CGCAGACCCG TGAGCACGTC CTCCTGGCCC GCCAGGTCGG CGTCCCCTAC 50 

3 5 ATCCTCGTCG CCCTGAACAA GTGCGACATG GTCGACGATG AGGAGATCAT 10 0 

CGAGCTCGTG GAGATGGAGG TGCGTGAGCT TCTCGCCGAG CAGGACTACG 150 

ACGAGGAGGC CCCGATCGTG CACATCTCCG CCCTGGGCGC CCTCAATGGC 200 

GAAGAGAAGT GGGTCGACTC CATCGTCGAG CTCATGAACG CCGTCGACGA 250 

GAACGTTCCG GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCCG 3 00 

40 TCGAGGACAT CTTCACCATC ACCGGCCGCG GCACCGTCGC CACCGGTCGC 350 

GTGGAGCGCG GCACCCTGAA GGTCAACGAC GAGGTCGAGA TCCTGGGCAT 400 

CCAGGAGAAG TCCCAGACCA CCACCGTCAC CGGCATCGAG ATGTTCCGCA 450 

AGCTGCTGGA CTCCGCCGAG GCCGGCGACA ACTGTGGCCT GCTGCTCCGC 500 

GGCATCAAGC GCGAGGACAT CGAGCGCGGC CAGATCATCG CGAAGCCGGG 55 0 

45 CGCCTACACC CCGCACACCG AGTTCGAGGG CTCCGTCTAC ATCCTGGCCA 600 

AGGACGAGGG CGGCCGCCAC ACCCCGTTCT TCGACAACTA CCGTCCGCAG 650 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCGTGAAGC TGCCGGAGGG 70 0 
CAC 



50 



2) INFORMATION FOR SEQ ID NO: 45 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

28 



703 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



GGAGCGATAT 
GGAACACATT 

10 ACTTGAACAA 
GAAATGGAAG 
GACGCCGATA 
GTGAGGTTGG 
TACTTCCCGC 

15 CGAAGATGTG 
TAGAGCGAGG 
AAGGACACGA 
ATTGGATGAA 
CGAAACGCGA 

2 0 ATCACGCCAC 
AGAAGGGGGA 
ATTTCCGCAC 
ATAGAGATGG 
TGCGCCGGTA 

25 



TGGTGGTGAG 
GTATTGGCGA 
AGCGGACATG 
TGAGGGATTT 
ATAGTGGGGT 
GGAGCCATCG 
AGCCGGAGCG 
TTTTCGATAT 
GATCATCAAA 
CGAAGACGAC 
GGTCAAGCGG 
AGAAGTGGAG 
ACAAGAAATT 
CGCCACACAC 
GACGGACGTG 
TGATGCCGGG 
GCGATGGATG 



CGCAGCGGAC 
AGCAAGTGGG 
GTGGATGACA 
ATTGAACAGT 
GAG CGTT AAA 
ATAATCAAAT 
AGCGATAGAC 
CGGGCCGAGG 
GTGGGCGACG 
GTGCACGGGC 
GTGACAACGT 
CGTGGTCAAG 
TGAGGCGGAG 
CGTTTTTACA 
ACGGGCCAGT 
AGATAACGTG 
AAGGGCTACG 



GGCCCGATGC 
TGTTCCGAAC 
AAGAGCTGTT 
TATGATTTCC 
GGCGTTAGAA 
TAGTGGAAAC 
AAACCGTTTT 
GACGGTGGTG 
AGATAGAGAT 
GTTGAGATGT 
AGGAATTTTA 
TATTGGCGAA 
ATTTATGTGT 
AGGCTATCGA 
TATTGAGTTT 
AAAGTGACGG 
AT 



CGCAAACGCG 50 

ATAGTGGTTT 100 

GGAATTAGTG 150 

CTGGGGATGA 2 00 

GGTGACAAGA 2 50 

GATGGACACG 300 

TAATGCCGAT 350 

ACGGGACGCG 400 

TGTGGGGATC 450 

TTCGCAAATT 500 

TTGAGAGGGA 550 

ACCGGGATCG 600 

TGTCGAAGGA 650 

CCGCAATTTT 70 0 

ACCGGAGGGG 750 

TTGAATTGAT 8 0 0 
832 



2) INFORMATION FOR SEQ ID NO : 4 6 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 
40 (B) STRAIN: ATCC 33379 

(Xi) SEQUENCE iDES^Tp^ION : ~ SEQ~ ~ID~~n61~ 46 " 

GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 5 0 

45 TGAGCACATC CTGCTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 10 0 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

TACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACGCTGGA CTCCTACATT 3 00 

50 CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 35 0 

CGTATTCTCA ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTAGG TAT CAAGCCG 450 

ACCACCAAGA CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 550 

55 GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC AGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGTTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACCACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATG CCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 8 00 

60 CGCCATGGAC GATGGT 816 
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2) INFORMATION FOR SEQ ID NO: 47 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Bdwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

20 GGCG CGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAG CACATC CTGTTGGGTC GCCAGGTAGG CGTTCCGTAC ATCAT CGTGT 100 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

25 AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACTCTGGA CTCCTACATC 300 

CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCT ATCTCTGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 40 0 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTTGG TATCAAGCCG 450 

ACCACCAAGA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

3 0 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 80 0 

CGCCATGGAC GATGGTCTGC G 821 



40 2) INFORMATION FOR SEQ ID NO : 48 

( i ) SEQUENCE CHARACTERIST I CS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

CGGTGCCATC CTGGTGGTAT CCGCTGCTGA CGGCCCCATG CCTCAGACTC 50 

GCGAACACAT CCTGTTGGCT CGT CAGGTAG GTGTACCCTA CATCCTCGTA 100 

TTCATGAACA AATGCGACAT GGTAGATGAT GCCGAGCTGC TTGAGTTGGT 150 

TGAGATGGAA ATCCGCGACC TGCTCTCCAG CTATGACTTC CCTGGTGACG 200 

ACTGCCCGAT CGTACAAGGT TCCGCTCTCA AAGCCCTCGA AGGCGATGCC 250 

30 
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GGTTACAAAG AAAAAATCTT 
CCCCACTCCT CAACGTGCTG 
ACGTATTCTC TATCTCCGGC 
CGCGGCATCA TCAAAGTAGG 
5 CACTCAGAAA ACTACCTGTA 
ACGAAGGTCA GGCCGGTGAC 
CGTGAAGAAG TTGAGCGTGG 
TCCGCACACC AAGTTCAAAG 
GTGGTCGTCA CACCCCGTTC 
10 CGTACTACTG ACGTAACCGG 
GGTTATGCCT GGTGAGAACG 
TTGCTATGGA AGAAGGTCTG 



CGAACTAGCT GCTGC^ITTGG 
TAGACAAACC CTTCCTGTTG 
CGTGGTACCG TAGTAACCGG 
TGAAGAGATC GAAATCGTTG 
CTGGCGTGGA AATGTTCCGC 
AACGTAGGCG TACTGCTGCG 
TCAAGTATTG GCTAAACCCG 
CCGAAGTATA CGTATTGAGC 
TTTGCCAACT ACCGTCCACA 
TGCTGTAGAG CTGGAGCCTG 
TAACCATCAC CGTAGAACTG 
CGCTTTGCGA 



ATAGCTACAT 3 00 

CCGATCGAAG 350 

TCGTGTAGAG 4 00 

GTCTGAAGCC 450 

AAACTGCTGG 500 

CGGTACCAAA 550 

GCACCATCAC 600 

AAAGAAGAAG 650 

GTTCTACTTC 700 

GTGTAGAAAT 750 

ATTGCTCCGA 800 
830 



15 



2) INFORMATION FOR SEQ ID NO: 49 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 808 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter aerogrenes 

(B) STRAIN: ATCC 13048 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

35 GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 2 00 

CACTCCGATC GTTCGTGGTT CTGCT CTGAA AG CGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCRGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

40 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGCGAAAA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGYGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAGCATCAAG 600 

CCGCACACCA AGTT CGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 65 0 

45 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGRTG 75 0 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGG 80 8 



50 



2) INFORMATION FOR SEQ ID NO : 5 0 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

31 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCW 250 

GAGTGGGAAG CG AAAAT CAT CGARCTGGCT GGCCACCTGG ATACCTATAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGCGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TYAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACYGCGAAA TCAACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCAT CAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGTCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCSCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTCGCA 82 R 

25 



. 2) INFORMATION FOR SEQ ID NO: 51 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter awnigenus 
40 (B) STRAIN: ATCC 33072 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACGC 50 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTAC CTGG ATTCTTACAT 300 

50 CCCGGAACCA GAACGTGCTA TCGATAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCCGGC CGTGGTACTG TTGTAACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCAG GCTCAATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA TATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG CACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TTCAGATGGT TGTTACCCTG ATCCACCCAA 800 

60 TCGCGATGGA TGACGGTCTG • CGTTT 825 
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2) INFORMATION FOR SEQ ID NO: 52 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter asburlae 

<B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTT CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCAAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTAC CTGT A CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

3 0 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCGAAGCCAG GCTCAATCAA 600 

GCCACACACC AAGTT CGAAT CTGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

35 GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTCTG CG 822 



4 0 2 ) INFORMATION FOR SEQ ID NO: 53 

( i ) SEQUENCE CHAF^CTERISTYcs": " 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 3 5317 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCAGGCGACG 200 

60 ACACTCCAAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 
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GAGTGGGAAG CAAAAATCAT CGAACTGGCT 
CCCAGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTTGG TGAAGAAGTT 
5 TACTGCKAAA TCTACCTGTA CTGGCGTTGA 
ACGAAGGCCG TGCTGGTGAG AACGTTGGTG 
CGCGAAGAAA TCGAACGTGG TCAGGTTCTG 
GCCACACACC AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
10 CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGCGACAACA TCAAGATGGT 
TCGCGATGGA CGACGGTCTG CGTTTC 



GGCTTCCTGG ATTCTTACAT 300 

ATTCCTGCTG CCAATCGAAG 350 

TTGTTACCGG TCGTGTAGAG 400 

GAAATCGTTG GTATCAAAGA 450 

AATGTT CCGC AAACTGCTGG 500 

TTCTGCTGCG TGGTATCAAA 550 

GCGAAGCCAG GCTCAATCAA 60 0 

CATCCTGTCC AAAGACGAAG 650 

ACCGTCCACA GTTCTACTTC 700 

CTGCCAGAAG GCGTAGAGAT 750 

TGTGACGCTG ATCCACCCAA 800 

826 



15 



25 



2) INFORMATION FOR SEQ ID NO: 54 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 806 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

GATCCTGGTA GTAGCTGCGA CTGACGGCCC AATGCCTCAG ACTCGTGAGC 50 
ACATCCTGCT GGGTCGTCAG GTAGGCGTTC CTTACATCAT CGTGTTCCTG 100 
AACAAATGCG ACATGGTTGA TGACGAAGAG CTGCTGGAAC TGGTAGAGAT 150 

3 5 GGAAGTTCGT GAACTGCTGT CTCAGTACGA TTTCCCAGGC GACGATACCC 200 

CAAT CGTTCG TGGTTCTGCT CTGAAAGCGC TGGAAGGCGA CGCAGAGTGG 250 
GAAGMGAAAA TCATCGAACT GGCTGGCTAC CTGGATTCTT ACATCCCAGA 3 00 

ACCAGAGCGT GCGATTGAYA AGCCATTCCT GCTGCCAATC GAAGACGTAT 350 
TCTCCATCTC CGGTCGTGGT ACCGTTGTTA CCGGTCGTGT AGAGCGCGGT 400 

4 0 ATCATCAAAG TGGGTGAAGA AGTTGAAATC GTTGGTATCA AAGAGACTGC 4 50 

GAAGTCTACC TGTACTGGCG TTGAAATGTT CCGCAAACTG CTGGACGAAG 500 

GCCGTGCTGG TGAGAACGTT GGTCTTCTGC~ TGCG^C^TAT ^A^CGTGAA " 55~0~~ 

GAAATCGAAC GTGGTCAGGT TCTGGCGAAG CCAGGCTCAA TCAAGCCACA 600 

CACCAAGTTC GAATCTGAAG TGTACATCCT GTCCAAAGAC GAAGGCGGCC 650 

GTCATACTCC GTTCTTCAAA GGCTACCGTC CACAGTTCTA CTTCCGTACA 700 

ACTGACGTGA CCGGTAC CAT CGAACTGCCA GAAGGCGTAG AGGTGGTAAT 750 

GCCAGGCGAC AACATCAAGA TGGTTGTGAC TCTGATCCAC CCAATCGCGA 800 

TGGACG 806 



2 ) INFORMATION FOR SEQ ID NO : 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 3 3 028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 10 0 

TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACCCCGAT CGTTCGCGGT TCTGCGCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCCACCTGG ATACCTAYAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATTTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAGGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCGAAA ACCACCTGTA CTGGCGTTGA AATGTT CCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTCGGCG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCCATCAA 600 

GCCGCACACC AAGTT CGAAT CTGAAGTGTA CATCCTGTCC AAAGACGAAG 6 50 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTC 82 6 



2 ) INFORMATION FOR SEQ ID NO : 56 

( i ) SEQUENCE CHARACTERI ST I CS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 10 0 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CAGGCGACGA 200 

CACCCCAATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAMGCAG 250 

AGTGGGAAGM GAAAATCATC GARCTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCAGAACCAG AG CGTGCGAT TGACAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTWGAGC 40 0 

GCGGTATCAT CAAAGTAGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CGAAGCCAGG CTCAATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACAT CAAGATGGTT GTGACGCTGA TCCACCCAAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 

35 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 57 

5 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter sakazakll 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

20 GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TGCGCGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCTG 250 

25 AGTGGGAAGC GAAAAT CATC GAGCTGGCAG GTCACCTGGA TTCCTACATC 3 00 

CCGGAACCGG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCYGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTTGGT GAAGAAGTTG AAATCGTGGG CATCAAAGAC 450 

ACCGCGAAAT CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AAC TGCTGGA 500 

30 CGAAGGCCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACRACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

35. GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT C 831 



40 2) INFORMATION FOR SEQ ID NO: 58 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: ATCC 25788 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 



CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GTGT AC CAT A CATCGTTGTT 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 200 

60 ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCYTGA AGGCGATGCT 250 

36 
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TCATACGAAG AAAAAATCAT GGAATTAATG 
TCCAACTCCA GAACGTGACA CTGACAAACC 
ACGTATTCTC AATCACTGGA CGTGGTACTG 
CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
5 AGAAACTGCT AAAACAACTG TAACTGGTGT 
TAGACTATGC TGAAGCAGGG GATAACATTG 
GCTCGTGAAG ACATCCAACG TGGACAAGTA 
CACACCTCAT ACAAAATTTA AAGCTGAAGT 
AAGGTGGACG TCACACACCA TTCTTCACTA 
10 TTCCGTACAA CTGACGTAAC TGGTGTTGTT 
AATGGTTATG CCTGGTGATA ACGTAACAAT 
CAATCGCTAT CGAAGACGGA ACTCGTTTCT 



GCTGCAGTTG ACGAATACGT 3 00 

ATTCATGATG CCAGTCGAAG 3 50 

TTGCTACAGG CCGTGTTGAA 4 00 

GAAATCGTTG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 5 00 

GTGCATTGCT ACGTGGTGTT 550 

TTGGCTAAAG CTGGTACAAT 600 

TTACGTTTTA ACAAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

GAATTACCAG AAGGAACTGA 750 

CGACGTTGAA TTGATCCACC 8 00 

CAATT 835 



15 



25 



2) INFORMATION FOR SEQ ID NO: 59 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
20 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

GGTGCTATCT TAGTAGTATC TGCTGCTGAT GGTCCTATGC CACAAACTCG 50 

TGAACACATT CTTTTATCAC GTAACGTTGG TGTTCCATAC ATCGTTGTTT 100 

TCTTAAACAA AGTTGATATG GTTGACGACG AAGAATTATT AGAATTAGTT 150 

35 GAAATGGAAG TACGTGACTT ATTAACTGAA TACGACTTCC CAGGAGACGA 200 

TGTTCCTGTA ATCGCTGGTT CTGCATTAAA AGCTTTAGAA GGCGACCCAT 250 

CTTACGAAGA AAAAATCTTA GAATTAATGG CTGCAGTTGA CGAATACATC 300 

CCAACTCCAG AACGTGACAA CGATAAACCA TTCATGATGC CAGTCGAAGA 350 

CGTATTTTCA ATCACTGGTC GTGGTACTGT TGCTACAGGT CGTGTTGAAC 400 

40 GTGGACAAGT ACGTGTTGGT GACGAAGTTG AAATAGTTGG TATCCATGAT 450 

GAAATTTCTA AAACAACAGT TACTGGTGTT GAAATGTTCC GTAAATTATT 500 

AGATTACGCT GAAGCTGGAG ACAACATCGG TGCATTATTA CGTGGTGTGG 550*"" 

CTCGTGAAGA TATCCAACGT GGTCAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CAAAATTCAC TGCTGAAGTG TACGTTTTAA CTAAAGAAGA 650 

45 AGGTGGACGT CATACTCCAT TCTTCACTAA CTACCGTCCA CAATTCTACT 700 

TCCGTACAAC TGACGTTACA GGTGTAGTTA ACTTACCAGA AGGTACTGAA 750 

ATGGTTATGC CTGGTGATAA CGTAACTATG GAAGTTGAAT TAATCCACCC 800 

AATCGCTATC GAAGACGGAA CTCGTT 826 



50 



2) INFORMATION FOR SEQ ID NO : 60 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

37 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 



CGGCGCGATC TTGGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCA CGTAACGTTG GTGTTCCTTA CATCGTCGTT 100 

10 TTCTTGAACA AAATGGACAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACGACTTC CCAGGCGACG 2 00 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

TCATATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGT CGAAG 350 

15 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATCGCTGA 450 

AGAAACTGCT AAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TGGATTACGC TGAAGCTGGC GACAACATTG GTGCATTATT ACGTGGTGTG 550 

GCTCGTGAAG ATATC CAACG TGGTCAAGTA TTATCAAAAC CAGGTTCAAT 600 

20 CACTCCACAT ACAAAATTTG CGGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 7 00 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTTACTAT GGACGTTGAA TTAATCCACC 800 

CAATCGCGAT CGAAGACGGT ACTCGTTTCT CAATC 835 

25 



2 ) INFORMATION FOR SEQ ID NO: 61 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JSntejro coccus duraxis 
40 (B) STRAIN: ATCC 19432 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 



CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

45 GTGAACATAT CCTATTATCT CGTCAAGTTG GTGTTCCTTA CATCGTYGTA 100 

TTCTTGAACA AAGTAGATAT GGTCGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AAT CGCTGGT TCAGCTTTGA AAGCTTTAGA AGGCGACGCT 25 0 

TCATACGAAG AAAAAATCCT TGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

50 CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 3 50 

ATGTATTCTC RATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATAT CGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACAG * TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGRCTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACKCCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

60 CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 83 5 

38 
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2) INFORMATION FOR SEQ ID NO: 62 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
!5 (A) ORGANISM: Enterococcus fa.eca.lis 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

20 AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACACGT GAACATATCT 50 

TATTATCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGATGACGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTATCAGAAT ACGATTTCCC AGGCGATGAT GTTCCAGTTA 200 

TCGCAGGTTC TGCTTTGAAA GCTTTAGAAG GCGACGAGTC TTATGAAGAA 25 0 

25 AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATATCC CAACTCCAGA 3 00 

ACGTGATACT GACAAACCAT TCATGATGCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGACG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTGAAGTT 400 

CGCGTTGGTG ACGAAGTTGA AATCGTTGGT ATTAAAGACG AAACATCTAA 450 

AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

30 AAGCAGGCGA CAACATCGGT GCTTTATTAC GTGGTGTAGC ACGTGAAGAT 550 

ATCGAACGTG GACAAGTATT AGCTAAACCA GCTACAATCA CTCCACACAC 600 

AAAATTCAAA GCTGAAGTAT ACGTATTATC AAAAGAAGAA GGCGGACGTC 650 

ACACTCCATT CTTCACTAAC TACCGTCCTC 680 



35 



50 



2) INFORMATION FOR SEQ ID NO : 63 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 680 bases 

CB) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R487 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACACGT GAACATATCT 50 

TATTATCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

55 ATGGATATGG TTGATGACGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTATCAGAAT ACGATTTCCC AGGCGATGAT GTTCCAGTTA 20 0 

TCGCAGGTTC TGCTTTGAAA GCTTTAGAAG GCGACGAGTC TTATGAAGAA 250 

AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATATCC CAACTCCAGA 300 

ACGTGATACT GACAAACCAT TCATGATGCC AGTCGAAGAC GTATTCTCAA 350 

60 TCACTGGACG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTGAAGTT 400 

39 
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CGCGTTGGTG ACGAAGTTGA AATCGTTGGT 
AACAACTGTT ACAGGTGTTG AAATGTTCCG 
AAGCAGGCGA CAACATCGGT GCTTTATTAC 
ATCGAACGTG GACAAGTATT AGCTAAACCA 
5 AAAATTCAAA GCTGAAGTAT ACGTATTATC 
ACACTCCATT CTTCACTAAC TACCGTCCTC 



ATTAAAGACG AAACATCTAA 450 
TAAATTATTA GACTACGCTG 500 
GTGGTGTAGC ACGTGAAGAT 550 
GCTACAATCA CTCCACACAC 600 
AAAAGAAGAA GGCGGACGTC 650 

680 



10 2) INFORMATION FOR SEQ ID NO: 64 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entsrococcus faeclum 

(B) STRAIN: R4 82 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



AGTAGTTTCT GCTGCTGACG GCCCAATGCC TCAAACTCGT GAAC ACATC C 50 

TATTGTCTCG TCAAGTTGGT GTTCCTTACA TCGTTGTATT CTTGAACAAA 10 0 

GTAGACATGG TTGATGACGA AGAATTACTA GAATTAGTTG AAATGGAAGT 150 

TCGTGACCTA TTAACAGAAT ACGAATTCCC TGGTGACGAT GTT CCTGTAG 200 

30 TTGCTGGATC AGCTTTGAAA GCTCTAGAAG GCGACGCTTC ATACGAAGAA 250 

AAAATTCTTG AATTAATGGC TGCAGTTGAC GAATACATCC CAACTCCAGA 3 00 

ACGTGACAAC GACAAACCAT TCATGATGCC AGTTGAAGAC GTGTTCTCAA 350 

TTACTGGACG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGACAAGTT 400 

CGCGTTGGTG ACGAAGTTGA AGTTGTTGGT ATTGCTGAAG AAACTTCAAA 450 

35 AACAACAGTT ACTGGTGTTG AAATGTTCCG TAAATTGTTA GACTACGCTG 50 0 

AAGCTGGAGA CAACATTGGT GCTTTACTAC GTGGTGTTGC ACGTGAAGAC 550 

ATCCAACGTG GACAAGTTTT AGCTAAACCA GGTACAATCA CACCTCATAC 600 

AAAATTCTCT GCAGAAGTAT ACGTGTTGAC AAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCACTAAC TACCGTCCTC AATTT 685 

40 



2) INFORMATION FOR SEQ ID NO: 65 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flave&cans 

(B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTAYG CCTCAAACAC 50 
GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTAC CATA CATCGTTGTT 100 

40 
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10 



15 



TTCTTAAACA 
TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAGACTATGC 
GCTCGTGAAG 
CACACCTCAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCTAT 



AAATGGATAT 
GTTCGTGACT 
AATCGCTGGT 
AAAAAATCAT 
GAACGTGACA 
AATCACTGGA 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCCAACG 
ACAAAATTTA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGA 



GGTTGATGAC 
TATTGTCAGA 
TCTGCTTTGA 
GGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCACTA 
TGGTGTTGTT 
AMGTAACAAT 
ACTCG 



GAAGAATTAC 
ATATGACTTC 
AAGCTCTTGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGCATTGCT 
TTAGCTAAAG 
TTACGTTTTA 
ACTACCGTCC 
GAATTACCAG 
CGACGTTGAA 



TAGAATTAGT 150 

CCAGGCGACG 2 00 

AGGCGATGCT 250 

ACGAATACGT 300 

CCAGTCGAAG 350 

CCGTGTTGAA 400 

GTATTGCTGA 450 

CGTAAATTGT 500 

ACGTGGGGTT 550 

CTGGTACAAT 600 

ACAAAAGAAG 650 

TCAGTTCTAC 700 

AAGGAACTGA 75 0 

TTGATCCACC 800 
82 5 



20 



2 ) INFORMATION FOR SEQ ID NO: 66 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Enterococcus galllnarum 

(B) STRAIN: R420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

35 TCCTATGCCT CAAACTCGTG AACACATCTT GTTATCACGT AACGTTGGCG 50 
TACCATACAT CGTTGTTTTC TTGAACAAAA TGGATATGGT TGATGACGAA 100 
GAATTGCTAG AATTAGTTGA AATGGAAGTT CGTGACCTAT TGTCTGAGTA 150 
TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCCGGTTCT GCTTTGAAAG 20 0 

CTCTTGAAGG AGATCCTTCA TACGAAGAAA AAATCATGGA ATTGATGGCT 250 
40 GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGATACTG ACAAACCATT 3 00 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA TGAAGTAGAA 4 00 

ATCGTTGGTA TTGCTGACGA AACTGCTAAA ACAACTGTAA CAGGTGTTGA 4 50 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AG C AGGGG AT AACATTGGTG 50 0 

45 CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTG 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTCAAAG CTGAAGTTTA 60 0 

TGTTTTGACA AAAGAAGAAG GTGGACGTCA CACTCC 63 6 



50 

2) INFORMATION FOR SEQ ID NO: 67 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 835 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 8043 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACATAT CCTAYTATCT CGTCAAGTTG GTGTTCCATA CATCGTTGTA 10 0 

TTCTTGAACA AAGTAGATAT GGTTGACGAC GAAGAATTAC TAGAATTAGT 150 

10 TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 2 00 

ATGTTCCTGT AGTTGCTGGT YCAGCTTTGA AAGCTTTAGA AGGCGACGCT 2 50 

TCATACGAAG AAAAAATCCT TGAATTGATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

15 CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 450 

AGAAACAG CT CAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 5 50 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 65 0 

20 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 70 0 

TTC CGTACRA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAACGGT ACTAAATT CT CAATC 83 5 



25 



40 



2) INFORMATION FOR SEQ ID NO : 6 8 



( i) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 



CGGAGCAATC TTAGTTGTTT CTGCTGCTGA CGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTATCT CGTCAAGTTG GTGTACCATA CATCGTTGTA 100 

45 TTCTTGAACA AAGTAGATAT GGTTGATGAC GAAGAATTAC TTGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTAA GAGCTTTAGA AGGCGACGCT 250 

KCATACGAAG AAAAAATTCT TGAATTGATG GCTGCAGTTG ACGAATATAT 3 00 
CCCAACTCCA GAACGTGATA ACGACAAACC ATTCATGATG CCAGTTGAGG - 35 0 

50 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG ACGTGTTGAA 400 

CGTGGACAAG YTCGTGTTGG TGACGTTATC GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GATAACATTG GTGCGTTACT ACGTGGTGTT 550 

TCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTAAAC CAGGTACAAT 600 

55 CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCYGTACGA CTGACGTAAC TRGTGTTGTY GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 83 5 



60 
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2) INFORMATION FOR SEQ ID WO: 69 



10 



15 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 4 93 72 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 



CGGAGCTATC 

20 GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACACTCYAGT 
TCATACRAAG 

25 CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAGACTACGC 

30 GCACGTGAAG 
CACTCCACAT 
AAGGCGGGCG 
TTCCGTACAA 
AATGGTAATG 

35 CAATCGCGAT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGACT 
TAT CGCTGGT 
AAAAAATCTT 
GTTCGTGATA 
AAT CACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGC 
ACATCCAACG 
ACAAAATTCT 
TCACACTCCG 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGA 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGATGAC 
TATTGTCAGA 
TCAGCYTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GATAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTTGTT 
ACGTAACTAT 
ACTCGTTTCT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGATTTC 
AAGCTTTAGA 
SCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCATTATT 
TTGGCTAAAC 
TTACGTTTTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 
CTATTC 



CCTCAAACAC 50 

CATCGYTGTA 100 

TAGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACCCT 250 

ACGAATACAT 3 00 

CCAGTCGAAG 350 

TCGTGTTGAA 400 

GTATCGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 750 

TTAATCCACC 80 0 
836 



40 



2) INFORMATION FOR SEQ ID NO: 7 0 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 4 9427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

55 CGGAGCTRTC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACTGA ATACGACTTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

60 TCATACGAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ATGAATACAT 300 
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CCCAACACCA GTTCGTGATA CTGACAAACC 
ACGTAYTCTC AATCACTGGT CGTGGAACTG 
* CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
AGAAACTGCT AAAACAACTG TTACAGGTGT 
5 TGGATTACGC TGAAGCGGGC GACAACATTG 
GCACGTGAAG ACATCCAACG TGGACAAGTA 
CACTCCACAT ACAAAATTCT CTGCAGAAGT 
AAGGCGGACG TCATACTCCA TTCTTCACTA 
TTC CGTACAA CTGACGTAAC TGGTGTAGTT 
10 AATGGTAATG CCTGGTGATA ACGTAACTAT 
CAATCGCGAT CGAAGACGGA ACTCGTTTCT 



ATTCATGATG CCAGYGGAAG 3 50 

TTGCAACTGG TCGTGTTGAA 4 00 

GAAATCGTAG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 5 00 

GTGCATTATT ACGTGGTGTT 550 

TTGGCTAAAC CAGCTTCAAT 600 

TTACGTTTTA ACTAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 7 00 

GATCTACCAG AAGGTACTGA 750 

GGAAGTTGAA TTAATCCACC 8 00 

CTATT 835 



15 2 ) INFORMATION FOR SEQ ID NO: 71 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DMA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43 076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

30 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

. GTGAACACAT CTTGTTATCT CGTAACGTAG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 2 00 

35 ACACTCCAGT TATTGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGATCCA 250 

GTTTACGAAG AAAAAATCTT CGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA CTGAAAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GAAATCGTTG GTATCGACGA 4 50 

40 AGAAACAGCT CAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GATAACATCG GTGCTTTATT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTAGCTAAAC CAGGAACAAT "~ 6 0 b" " 

CACTCCTCAT ACAAAATTCG TAGCTGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 7 00 

45 TTCCGTACAA CTGACGTAAC TGGTGTTGTA GAATTACGCG AAGGTACTGA 750 

AATGGTAATG CCTGGTGACA ACGTAACTAT CGACGTTGAA TTAATCCACC 8 00 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CTATT 835 



50 

2) INFORMATION FOR SEQ ID NO: 72 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 823 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarlus 

(B) STRAIN: ATCC 4 9428 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

GAGCTATCTT GGTAGTTTCT GCAGCTGATG GCCCAATGCC ACAAACTCGT 50 

GAACATATTT TGTTGTCACG TAATGTAGGT GTACCTTACA TCGTTGTGTT 100 

CTTGAACAAA ATGGATATGG TTGATGACGA AGAATT AC TT GAGTTAGTTG 150 

10 AAATGGAAGT ACGTGATCTA TTATCTGAAT ACGACTTCCC AGGAGATGAT 2 00 

ACTCCAGTTA TTTCCGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA 250 

ATATGAACAA AAAATTATGG ACTTAATGGA TGCAGTTGAT GACTACATTC 3 00 

CAACTCCTGA ACGTGACCAT GACAAACCAT TCATGATGCC AATTGAAGAT 350 

GTATTTTCAA TTACAGGCCG TGGTACTGTT GCTACAGGAC GTGTTGAACG 4 00 

15 CGGGACTATC AAAGTCGGCG ATGAAGTTGA CATTATTGGT ATTCATGAAG 450 

ACGTTAAAAA GACAACAGTT ACTGGTGTAG AAATGTTCCG TAAATTGTTG 500 

GACTACGCTG AAGCAGGCGA TAACATTGGT ACTTTGTTAC GTGGTGTTTC 550 

TCGTGATGAT ATCGAACGTG GTCAAGTATT AGCTAAACCA GGTTCAATCA 60 0 

CACCACATAC AAGATTCTCT GCTGAAGTTT ATGTTTTGAC TAAAGAAGAA 650 

20 GGCGGACGTC ATACTCCATT CTTCTCAAAC TATCGTCCTC AATTC TACTT 7 00 

CCGTACAACT GATATCACTG GTGTCATTGA ATTGCCAGAA GGTACTGAAA 750 

TGGTAATGCC AGGTGATAAT GTAACAATGG ATGTTGAATT AATCCACCCA 800 

GTCGCTATCG AAGAAGGAAC TCG 82 3 



25 



40 



2) INFORMATION FOR SEQ ID NO: 73 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



CGGTGCAATC TTGGTCGTAT CAGCGACAGA TGGCCCAATG CCACAAACAC 50 

GGGAGCATAT TTTGCTTTCT CGTCAAGTGG GTGTGAAACA TTTGATCGTC 10 0 

45 TTTTTGAATA AGACGGACCT TGTCGATGAT GACGAGTTGA TCGATTTAGT 150 

TGAAATGGAA GTCAGAGAAT TGCTGACTGA ATATGATTTT CCTGGCGACG 2 00 

ACATTCCTGT GATCAAGGGC TCTGCGTTAA AAGCCTTGGA AGGGGACCCA 250 

GATGCTGAAG CAGCGATCTT AACGCTGATG GATACAGTAG ATGAATATAT 30 0 

CCCAACGCCA GAAC GTGATA CTGACAAACC ATTGTTGTTA CCGATCGAAG 350 

50 ATGTCTTTTC GATCACAGGA CGGGGGACCG TTGCTTCTGG TCGGATCGAT 40 0 

CGCGGCATGG TAAAAGTCGG GGATGAAGTA GAAATCGTCG GAATCAAACC 45 0 

TGAAACACAA AAAGCAGTCG TGACAGGGGT AGAAATGTTC CGCAAAACGA 50 0 

TGGACTTCGG AGAAGCTGGC GATAACGTAG GGGTATTGTT ACGGGGCATC 550 

ACCCGTGATG AAATTGAACG TGGCCAAGTG TTAG CAAAAC CAGGTTCTAT 600 

55 CACACCGCAT ACGAAATTCC AAGCGGAAGT CTATGTGTTG ACAAAAGAAG 650 

AAGGCGGTCG CCATACCCCA TTCTTTAATA ATTATCGCCC ACAATTTTAC 700 

TTCCGTACAA CGGACGTAAC TGGGAATATC GTTTTACCAG AAGGAACGGA 750 

AATGGTGATG CCTGGTGACA ACGTAACGAT CGATGTGGAA TTGATCCATC 80 0 

CGATCGCTGT AGAAAATGGA ACGACCTTCT CGATT 835 

60 
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2) INFORMATION FOR SEQ ID NO: 74 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
15 (B) STRAIN: ATCC 153 05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

20 AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

AGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACTTGCTT 2 00 

TTTTAAGTAT AGGTATAAAA TACGATTGAT TAAAACAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTAC CTAT TACTTTTTAT 300 

25 TACAACAACG CTGATTGGCG CGTTACTATT ATATTTGCCA ATCACAGGTA 350 

AACATCCTAT TGATTTTGTG GACGCCCGTT 380 



30 2) INFORMATION FOR SEQ ID NO: 75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) ' STRAIN: ATCC 4 9996 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

GCAATCTTGG TCGTATCAGC GACAGATGGC CCAATGCCAC AAACACGGGA 50 

GCATATTTTG CTTTCTCGTC AAGTGGGTGT GAAACATTTG ATCGTCTTTT 100 
TGAATAAGAC GGACCTTGTC GATGATGACG AGTTGATCGA TTTAGTTGAA • 150 

ATGGAAGTCA GAGAATTGCT GACTGAATAT GATTTTCCTG GCGACGACAT 2 00 

TCCTGTGATC AAGGGCTCTG CGTTAAAAGC CTTGGAAGGG GACCCAGATG 250 

CTGAAGCAGC GATCTTAACG CTGATGGATA CGGTAGATGA ATATATCCCA 3 00 

ACGCCAGAAC GTGATACTGA CAAACCATTG TTGTTAC CGA TCGAAGATGT 350 

CTTTTCGATC ACAGGACGGG GGACCGTTGC TTCTGGTCGG ATCGATCGCG 400 

GCATGGTAAA AGTCGGGGAT GAAGTAGAAA TCGTCGGAAT CAAACCTGAA 450 

ACACAAAAAG CAGTCGTGAC AGGGGTAGAA ATGTTCCGCA AAACGATGGA 500 

CTTCGGAGAA GCTGGCGATA ACGTAGGGGT ATTGTTACGG GGCATCACCC 550 

GTGATGAAAT TGAACGTGGC CAAGTGTTAG CAAAACCAGG TTCTATCACA 600 

CCGCATACGA AATTCCAAGC GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 650 

CGGTCGCCAT ACCCCA 666 

46 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



2) INFORMATION FOR SEQ ID NO: 76 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gall in arum 
15 (B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

TGGTGCGATT TTAGTTGTAT CCGCAACAGA TGGTCCAATG CCTCAAACCC 50 

20 GGGAACATAT CTTGCTTTCG AGACAAGTTG GTGTGAAACA TCTGATTGTT 100 

TTCTTGAACA AAATCGATTT AGTCGATGAC GAAGAATTGA TTGATTTAGT 150 

AGAAATGGAA GTAAGAGAAC TGCTATCTGA ATATAATTTT CCAGGGGATG 2 00 

ACATTCCTGT TAT CAAAGGT TCGGCATTAA AAGCGTTGGA AGGAGACCCT 250 

GATGCAGAAG CTGCCATCAT GGAATTAATG GATACAGTAG ACAGCTATAT 3 00 

25 CCCAACACCT GAGCGTGATA CAGACAAACC ATTACTCTTG CCAGTTGAAG 3 50 

ATGTCTTTTC GATTACTGGA CGAGGAACAG TTGCTTCCGG ACGGATCGAT 4 00 
CGGGGAACAG TTCGGGTAGG CGATGAAGTA GAAATCGTCG GTATCAAACC • 450 

TGAAACCCAA AAAGCTGTAG TGACAGGCGT CGAAATGTTC CGCAAGACGA 50 0 

TGGACTTTGG GGAAGCCGGT GACAATGTAG GTGTCTTGCT GAGAGGGATC 550 

3 0 ACTCGTGACG AAATTGAACG AGGACAAGTG TTGGCTAAAC CAGGTTCGAT 6 00 

CACACCACAT ACAAAATTCC AAGCAGAAGT TTATGTATTG ACGAAAGAAG 650 

AAGGTGGTCG TCATACACCA TTCTTCAACA ACTATCGTCC ACAATTTTAT 700 

TTCCGTACAA CGGATGTGAC AGGGAACATT ACATTGCCTG AAGGAACAGA 750 
A 



35 



751 



2) INFORMATION FOR SEQ ID NO: 77 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia canis 
SO (B) STRAIN: Florida 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

TGCAGCAATA TTAGTAGTGT CTGCAACTGA TGGAGCAATG CCACAAACAA 50 

55 GAGAACATAT ATTATTAGCA AAGCAAGTAG GTGTAAAAGA TATAGTAGTG 100 

TGGATGAATA AGTGTGATGT TGTAGATGAT GAAGAAATGT TGTCATTAGT 150 

TGAAATGGAA ATAAGGGAAT TGTTATCAAA ATATGGGTAT CCTGGGGATG 200 

ATATAGATGT AGTTAGAGGA TCTGCAGTTA AAGCATTAGA AGAAGAAACA 250 

GGCTCAGGTG TGTGGAGTGA AAAAATAATG GAATTGATGA ATGCTTTAGA 3 00 

60 AAAAATAAGT TTACCAGTAA GAGAAAAAGA TAAGCCATTT TTAATGTCAA 350 
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TAGAAGATGT GTTTTCAATA CCTGGAAGAG 
ATAGAAAGAG GAGTAATTAG AGTAGGGGAT 
GCGTGAGATA CAAAGTACAG TATGTACAGG 
CATTAGATGC AGGAGAAGCA GGGGATAATG 
5 ATAAAAAAAG AAGATGTAGA AAGAGGGCAA 
GATACATTCA TATAAGAGAT TTAAGGCAGA 
AAGAAGGAGG AAGACATACT CCATTTTTCT 
TATGTTAGAA CAACAGATGT AACAGGGAAT 
AGAAATGGTA ATGCCAGGGG ATAATATAAA 
10 AGCCTGTTGC TATTGATCAA GGATTGAGAT 



GTACAGTAGT AACAGGAAGA 4 00 

AAAATAGAGA TAGTAGGATT 4 50 

TGTTGAAATG TTTCATAAAG 500 

CTGGAATATT GTTAAGAGGG 550 

GTATTGAGTG CACCTGGACA 600 

GGTATATATA TTGAAAAAAG 650 

CAAATTACCA GCCGCAATTT 700 

ATAAAGTTAC CAGAAGGAGT 750 

TATCGAAGTG AGTTTGGATA 800 

TTGC 834 



2) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTT CGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

35 GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTAYAT 3 00 

TCCGGAACCA GAG CGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

40 ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG" ~ 650~ " 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

45 GGTAATGCCG • GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGT 817 



15 



20 



25 



30 



50 2) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia, fergusonii' 

(B) STRAIN: ATCC 35469 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

5 

CGATCCTGGT AGTTGCTGCG ACTGACGGCC CGATGCCGCA GACTCGTGAG 50 

CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTTG ATGACGAAGA GCTGCTGGAA CTGGTTGAAA 150 

TGGAAGTTCG TGAACTTCTG TCTCAGTACG ACTTCCCGGG CGACGACACT 200 

10 CCGATCGTTC GTGGTTCTGC TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG 250 

GGAAGCGAAA ATCCTGGAAC TGGCTGGCTT CCTGGATTCT TACATTCCGG 3 00 

AACCAGAGCG TGCGATTGAC AAGCCGTTCC TGCTGCCGAT CGAAGACGTG 35 0 

TTCTCCATCT CCGGTCGTGG TACCGTTGTT ACCGGTCGTG TAGAACGCGG 400 

TATCATCAAA GTTGGTGAAG AAGTTGAAAT CGTTGGTATC AAAGAGACTC 450 

15 AGAAGTCTAC CTGTACTGGC GTTGAAATGT TCCGCAAACT GCTGGACGAA 5 00 

GGCCGTGCTG GTGAGAACGT AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 550 

AGAAATCGAA CGTGGTCAGG TACTGGCTAA GCCGGGCACC ATCAAGCCGC 600 

ACACCAAGTT CGAATCTGAA GTGTACATTC TGTCCAAAGA TGAAGGCGGT 650 

CGTCATACTC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 7 00 

20 TACTGACGTG ACTGGTACCA TCGAACTGCC GGAAGGCGTA GAGATGGTAA 750 

TGCCGGGCGA CAACATCAAA ATGGTTGTTA CCCTGATCCA CCCGATCGCG 8 00 

ATGGACGACG GTCTGCGTTT CGCAA 82 5 



25 



35 



2) INFORMATION FOR SEQ ID NO : 8 0 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 9 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

GGCGCGATCC TGGTTGTTGC" TGCGATTGAC GGCCCGA^GCf CGCA^A^CCgT " 50~ 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

45 GAGATGGAAG TTCGCGAACT GCTGTCCCAG TACGATTTCC CGGGCGACGA 2 00 

CACCCCGATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 

AGTGGGAAGA GAAAATCATC GAACTGGCTG GCTACCTGGA TTCCTATATC 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

50 GCGGTATCAT CAAAGTGGGT GAAGAAGTTG AAATCGTGGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG TTCCATCAAG 60 0 

CCKCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

55 CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA C GTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 

60 
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2 ) INFORMATION FOR SEQ ID NO : 81 

( i ) SEQUENCE CHARACTERISE CS : 
5 (A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 { i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Escherichia, vulneris 

(B) STRAIN: ATCC 33821 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

20 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TTCTGTCCCA GTACGACTTC CCGGGCGACG 200 

ACACCCCGAT CATTCGTGGT TCTGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 

GAGTGGGAAG AGAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCCTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTC CTGCTG CCGAT CGAAG 3 50 

25 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAARGTKGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

AC GAAGGTCG TGCAGGCGAG AACTGCGGCG TTCTGCTGCG TGGTATCAAG 55 0 

CGTGAAGAGA TCCAGCGTGG CCAGGTTCTG GCTAAGCCGG GCTCAATCAA 600 

30 GCCGCACACC AAGTTCGAAT CCGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CAC CATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCATCCGA 8 00 

TCGCGATGGA CGACGG 816 

35 



. 2 ) INFORMATION FOR SEQ ID NO : 82 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium 1 en turn 
50 (B) STRAIN: ATCC 43055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

CGGCGCCTCC TCGTTATCGC CGCCACCGAC GGCCCGATGG CCCAGACCCG 50 

55 CGAGCACATC CTGCTCGCCC GTCAGGTCGG CGTGCCCTAC ATCGTGGTCT 100 

TCCTGAACAA GTGCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAAG TTCGCGAGCT GCTCGACTCT TACGAGTTCC CGGGCGACGA 200 

CACCCCGATC ATCCGCGGCT CCGCTTTGAA GGCCCTCGAG GGCGACAAAG 250 

AGTGGCAGGA GAAGGTCTGG GAGCTCATGG ACGCCGTCGA CTCCTACATC 300 

60 CCGACGCCGG AGCGCATGGT CGACAAGCCG TTCCTGATGG CCGTCGAGGA 350 

50 
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CACGATGACC ATCACCGGCC GCGGCACCGT 
GTGGTACGCT GCATGTCAAC GACCCGCTGG 
ACCCAGAACA CGGTCTGCAC CGGTATCGAG 
CGAGGCTCAG GCCGGCGACA ACATCGGCTG 
5 GCGAGGAGAT CGTTCGCGGC CAGGTTCTCT 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGCCAC ACGCCGTTCT TCGACGGCTA 
GCACGACGGA CGTGACGGGT GTTGCCCACC 
GTCATGCCGG GCGACAACGT GGAGATCAAG 
10 CGCCAGGAAG AGGGCTGCGT TCGCTAAC 



TGCCACCGGT CGTGTGGAGC 4 00 

AGATCGTCGG TATCAAGGAG 450 

ATGTTCCGCA AGCTGCTCGA 500 

CCTGCTCCGC GGTGTCAAGC 550 

GCAAGCCCGG TAGCGTGACC 600 

ATCC TGACGA AGGAAGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TTCCCGAGGG CACCGAGATG 750 

GGCGAGCTCA TTCACCCGAT 8 00 

828 



15 



2 ) INFORMATION FOR SEQ ID NO: 83 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Eubacterium nodatum 

(B) STRAIN: ATCC 33099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

30 GGAGCAATTC TGGTTTGTGC AGCAACKGAC GGACCAATGC CTCAGACAAG 50 

AGAACATATC CTTTTGTCAA GGCAGGTAGG AGTGCCATAT ATCATCGTAT 100 

TCCTGAATAA ATGTGACATG GTGGATGAYG AAGAGCTTCT GGACTTGGTA 150 

GAGATGGAAG TAAGAGAACT TCTCAGTGAG TATGAATTCC CGGGAGATGA 200 

TACCCCGATA GTAAGAGGTT CAGCCCTGAA GGCACTGGAA GAACC CAATG 250 

3 5 GAGAATGGGC AGACAAGATT GTAGAGCTGA TGGAGGAAGT AGATAAATAC 3 00 

ATTCCTGAAC CAAAGAGAGA TAACGACAAA CCGTTCCTGA TGCCTGTAGA 3 50 

GGACGTATTC TCAATAACAG GAAGAGGAAC AGTAGCGACA GGAAGRGTTG 4 00 

AAAGAGGAAT CCTGAAGGTC GGTGATGAAG TAGAAATCGT GGGAATGAGC 450 

GAAGAGAGAA GAAAGGTAGT AGTAACGGGA GTTGAAATGT TCAGAAAGCT 500 

40 TCTGGATGAA GCAGAGACAG GAGACAACAT CGGAGCACTG CTGAGAGGAG 550 

TTCAGAGAAC RGAGATCCAG AGAGGTCAGG TATTGGCRGC ACCTGGAACG 600 

ATCAACCCAC iVTACAAAcY^ 6b0~~ 

AGAAGGAGGA AGGCATACGC CGTTCTTCAA YGGATACAGW CCACAGTTCT 700 

ACTTCAGAAC AACAGACGTA ACAGGAGATT TGCAGCTGCC GGAAGGARCA 750 

45 GAGATGTGCA TGCCGGGAGA TAATGTGGTA ATGAACRTCA GCCTGATCAC 800 

TCCGATTGCT ATAGAAGAGG GWCTGAGATT TGCCA 835 



50 2 ) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

51 
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(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 



GCGATCCTGG 
GCACATCCTG 
TGAACAAATG 
ATGGAAGTTC 

10 CCCAGTTGTT 
GGGAAGCTAA 
GAACCAGAGC 
ATTCTCCATC 
GTATCATCAA 

15 GTTAAGTCTA 
AGGCCGTGCY 
AAGACATCGA 
CACACCACWT 
CCGTCATACT 

2 0 CAACTGACGT 
ATGCCAGGTG 
GATGGATGAC 



TTGTTGCTGC 
TTGGGTCGYC 
CGACATGGTT 
GYGAACTTCT 
AAAGGTTCAG 
GATCATCGAG 
GTGCGATTGA 
TCCGGTCGTG 
AGTTGGCGAA 
CTTGTACTGG 
GGTGAGAACG 
ACGTGGTCAG 
TYGACTCAGA 
CCGTTCTTCA 
GACCGGTACC 
ACAACATCAA 
GGTCTGCGTT 



AACTGATGGC 
AGGTTGGCGT 
GATGACGAAG 
GTCTGCTTAC 
CGCTGAAAGC 
CTGGCTGGCC 
CAAGC CATTC 
GTACCGTTGT 
GAAGTTGAAA 
CGTTGAAATG 
TTGGTGTTCT 
GTTCTGGCTA 
AGTTTATATC 
AAGGCTACCG 
ATC GAACTGC 
CATGRTAGTT 
TCGCAA 



CCTATGCCAC 
TCCATTCATG 
AGCTGCTGGA 
GATTTCCCAG 
ACTGGAAGGC 
ACCTGGATAG 
CTGCTGCCAA 
TACCGGTCGT 
TCGTTGGTAT 
TTCCGCAAAC 
GCTGCGTGGT 
AACCAGGTTC 
CTGAGCAAAG 
TCCACAGTTC 
CAGAAGGCGT 
ACCCTGATCC 



AGACTCGTGA 50 

ATCGTATTCA 100 

ACTGGTAGAA 150 

GCGACGACAT 200 

GAAGCAGAGT 250 

CTACATCCCA 3 00 

TCGAAGACGT 350 

GTAGAGCGCG 400 

CAAGGACACT 450 

TGC TGGACGA 500 

ATCAAGCGTG 550 

AATCAAACCA 600 

ATGAAGGCGG 650 

TACTTCCGTA 700 

AGAGATGGTA 750 

ACCCAATCGC 800 

826 



25 



35 



2) INFORMATION FOR SEQ ID NO: 85 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 8 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN: LVS 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 



TGGTG CTATT CTAGTATGTT CTGCTGCGGA TGGTCCTATG CCACAAACTC 50~~ 

GTGAGCACAT TCTGCTTTCT CGTCAAGTTG GTGTACCAAA AATCGTTGTT 100 

TTCTTAAACA AGTGTGACAT GGTTGATGAT GAAGAGTTAT TAGAGCTAGT 150 

45 TGAGATGGAA GTTCGTGAGC TTTTAGATCA GTATGAGTTC CCAGGTGATG 2 00 

ACACTCCAGT TATTATGGGT TCAGCTCTTA GAGCTATTGA AGGTGACGAA 250 

GCTTACGTTG AGAAAATTGT TGAGCTAGTT CAAGCTATGG ATGACTATAT 3 00 

TCCTGCTCCT GAGCGTGATA CTGAGAAGCC ATTTATTCTT CCGATCGAAG 350 

ATGTATTCTC AATTTCAGGT CGTGGTACTG TTGTAACTGG TCGTATTGAG 4 00 

50 CGCGGTGTAG TTAACGTTGG TGATGAAGTT GAAGTTGTTG GTATTCGTCC 450 

AACTCAAAAA ACTACAGTAA CTGGTGTGGA AATGTTCCGT AAGCTTTTAG 500 

ATAGAGGGGA AGCTGGTGAT AACGTTGGTA TCCTAGTTCG TGGAC TTAAG 550 

AGAGATGATG TTGAGCGTGG ACAAGTATTA TGTAAGCCAG GTTCAATTAA 600 

GCCACATACT. AAGTTTGAAG CTGAGGTTTA TGTATTATCT AAAGAAGAGG 650 

55 GTGGTAGACA TACTCCATTC TTCAAGGGAT ATAGACCACA ATTCTACTTC 700 

CGTACTACAG ACATTACTGG AGCTGTTGAG CTTC CAGAGG GTGTAGAAAT 750 

GGTTATGCCT GGTGATAACG TTAAGATGAC TATCACTCTA ATTAACCCAA 800 

TCGCTAGGAT GAAGGGTTAC GTTTTGCA 828 

60 

52 
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2 ) INFORMATION FOR SEQ ID NO: 86 



10 



15 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphum 

(B) STRAIN: ATCC 10953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



20 



25 



30 



35 



CGGTGCTATC 
GTGAGCAYAT 
TTCATGAACA 
TGAAATGGAA 
ACACTCCKTT 
AAGTGGGAAG 
TCCTTTGCCT 
ACGTATTCTC 
GCTGGTGTTA 
AGACAAGAAG 
ATCAAGGTGA 
AAGAACGAAA 
ACCTCACTCT 
GTGGTCGTCA 
CGTACTATGG 
GGTAATGCCT 
TAGCATTGAA 



ATCGTAGKTG 
CYTGCTGGCT 
AGTGYGACAT 
ATGCGTGAAC 
CATTCAGGGT 
AGAAGGTTAT 
CCGCGTGATA 
AATCACTGGT 
TCCATGTAGG 
TCTGTTGTAA 
AGCTGGTGAC 
TCAAACGTGG 
AAGTTCAAAG 
CACTCCGTTC 
ACTGTACAGG 
GGTGATAACG 
CGTAGGTTTG 



CTGCTACTGA 
CGTCAGGTAA 
GGTAGACGAC 
TGCTTTCAGC 
TCTGCTCTTG 
GGANCTGATG 
TTGAYAAACC 
CGTGGTACTG 
TGACGAAGTT 
CTGGTGTTGA 
AACGTAGGTY 
TATGGTTCTT 
CTTCTATCTA 
CACAACAAAT 
TGAAATCWCT 
TAGAAATCAC 
CGTTTCGCT 



TGGTCCGATG 
ACGTWCCKAG 
GCTGAAATGY 
YTACGAATTC 
GTGCRTTGAA 
GATGCTTGCG 
GTTCTTGATG 
TAGCTACTGG 
GAAATCCTCG 
AATGTTCCGC 
TGTTGCTCCG 
TGTAAGCCCG 
CGTTTTGAAG 
ACCGTCCTCA 
CTTCCGGAAG 
TGTAGAACTG 



CCTCARACTC 50 

ACTGGTTGTA 100 

TGGAACTCGT 150 

GAYGGYGACA 200 

YGGCGTTGAA 250 

ACACTTGGAT 300 

CCGGTTGAAG 350 

TCGTATCGAA 400 

GTTTGGGTGA 450 

AAGTTGCTGG 500 

TGGTATC GAC 550 

GTCAGATTAA 600 

AAAGAAGAAG 650 

GTTCTATCTG 700 

GAACTGAAAT 750 

ATCTACCCGG 800 
829 



2) INFORMATION FOR SEQ ID NO : 87 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid " 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 
50 (B) STRAIN: ATCC 103 79 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

CTATCTTAGT AATCGCTGCT ACAGATGGAC CAATGGCTCA AACTCGTGAG 50 

55 CACATCCTAT TATCTCGTAA CGTTGGAGTA CCAAAAATCG TTGTATTCTT 100 

AAACAAATGT GATATGGTTG ATGACGAAGA GTTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGAACTATTA TCTGAATACG GATTCGACGG AGATGAACTA 200 

CCAGTAATCA AAGGTTCTGC TCTTAAAGCT CTTGAAGGAG ATGCAGATGC 250 

AGAAAAAGCT ATCATCGAAT TAATGGAAAC AGTTGACGAA TACATCCCAA 300 

60 CTCCAGAACG TGATAACGCT AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

53 
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TTCTCAATCA CAGGTCGTGG TACAGTTGCT 
ACAAGTTAAA GTTGGAGACG TAGTAGAAAT 
CAGCTTCAAC .TACTGTAACA GGTGTTGAAA 
TACGCTGAAG CAGGAGATAA CATCGGTGCA 
5 TGAAGACATC GAACGTGGAC AAGTTTTAGC 
CACACACTCA ATTCGTAGCT GACGTGTACG 
GGACGTCACA CTCCATTCTT CACAAACTAC 
TACTACTGAC GTAACTGGTG TAGTTACTTT 
TAATGCCTGG GGATAACGTA TCAATCAACG 
10 GCGATCGAAG AAGGAACTCG TTTCTCAA 



ACTGGACGTG TTGAACGTGG 400 

CGTTGGATTA ACTGAAGAAC 450 

TGTTCCGTAA ATTATTAGAT 500 

TTATTACGTG GTGTTG CTCG 550 

AGCTCCTAAA ACAATCACTC 600 

TATTATCTAA AGAAGAAGGT 650 

CGTCCTCAAT TCTACTTCCG 700 

ACCAGAAGGT ACTGAAATGG 750 

TAGAACTTAT TTCTCCAATC 800 

828 



15 



2) INFORMATION FOR SEQ ID NO: 88 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 2 7824 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

3 0 TCTTAGTAAT CGCTGCTACA GATGGTCCTA TGGCTCAAAC TCGTGAACAC 50 

ATCCTATTAT CTCGTAACGT TGGAGTACCT AAAATTGTTG TATTCTTAAA 100 

CAAATGTGAT ATGGTTGATG ACGAAGAGTT ATTAGAATTA GTAGAAATGG 150 

AAGTTCGTGA ACTATTATCT GAATACGGAT TTGATGGAGA TGAACTACCA 200 

GTAATCAAAG GTTCAGCTCT TAAAGCTCTT GAAGGAGATG CAGATGCTGA 250 

3 5 AAAAGCTATC ATCGAATTAA TGGAAACAGT TGACGAGTAC ATCCCAACTC 300 

CAGAACGTGA TAACGCTAAA CCATTTATGA TGCCAGTTGA GGACGTGTTC 350 

TCAATCACAG GTCGTGGTAC AGTTGCTACT GGACGTGTTG AACGTGGACA 400 

AGTTAAAGTT GGTGACGTAG TAGAAATCGT TGGATTAACT GAAGAACCAG 450 

CTTCAACTAC TGTAACAGGT GTTGAAATGT TCCGTAAATT ATTAGATTAC 500 

4 0 GCTGAAGCAG GAGATAACAT CGGTGCATTA TTACGTGGTG TTGCTCGTGA 550 

AGATATCGAA CGTGGACAAG TTTTAGCAGC TCCTAAAACA ATCACTCCAC 600 

ATACTCAATT CGTAGCTGAT GTGTACGTAT TATCTAAAGA AGAAGGTGGA 650 

CGTCACACTC CATTCTTCAC AAACTACCGT CCACAATTCT ACTTCCGTAC 700 

TACTGACGTA ACTGGTGTAG TTACTTTACC AGAAGGTACT GAAATGGTAA 750 

45 TGCCTGGGGA CAACGTATCA ATCAACGTAG AACTTATTTC TCCAATCGCT 8 00 

ATCGAAGAAG GAACTCGTTT CTC 823 



50 2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

54 
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(A) ORGANISM: Haemophilus actinomycetemcomitans 

(B) STRAIN: ATCC 3 33 84 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

5 

GCTATCTTAG TAGTAGCAGC AACCGACGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCGG GCGATGACAC 200 

10 CCCAATCGTA CGCGGTTCTG CATTAAAAGC GCTTGAAGGC GATGCCGCAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCAAACC ATTTAGATAC TTACATCCCG 3 00 

GAACCTGAGC GTGCTATCGA CCAACCGTTC CTTCTTCCAA TTGAAGATGT 3 50 

GTTCTCTATC TCCGGTCGTG GTACCGTAGT AACGGGTCGT GTTGAGCGCG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTGGGTAT CAAACCGACT 450 

15 GCAAAAACCA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCG GGTGAAAACA TCGGTGCATT ATTGCGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTATTGGCGA AACCGGGGTC AATCACCCCG 600 

CACACTGACT TCGAATCTGA AGTGTACGTA TTGTCCAAAG AAGAAGGTGG 65 0 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

20 CAACTGACGT AACCGGTACT ATCGAGTTAC CTGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATTC ACCCAATTGC 800 

GATGGACCAA GGTTTACGTT TCGCTATCG 829 



25 



2) INFORMATION FOR SEQ ID NO: 90 



( i ) SEQUENCE CHARACTER I STI CS : 
(A) LENGTH: 833 bases 
3 0 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus aphrophilus 

(B) STRAIN: ATCC 33389 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 



TGGTGCTATC TTAGTAGTAG~ CAGCAACTGA TGGTCCTATG CCACA^CTC 5o" 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGATG 200 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTAG ATACTTACAT 300 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATTGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACTG TAGTAACAGG TCGTGTTGAG 400 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 450 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACAT CGGTG CATTATTACG TGGCACTAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GCTCAATCAC 600 

TCCGCACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 6 50 

55 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GCGTGGAAAT 7 50 

GGTTATGCCT GGCGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 833 

60 

55 
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2 ) INFORMATION FOR SEQ ID NO: 91 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



CGGCGCTATC TTAGTTGTAG CAGCAACTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTATTAGGC CGCCAAGTTG GTGTTCCTTA CATCATCGTA 100 

2 0 TTCTTAAATA AATGCGATAT GGTAGATGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGATTTC CCAGGTGACG 2 00 

ATACTCCTAT CGTTCGTGGT TCAGCATTAC AAGCATTAAA TGGTGTGCCT 250 

GAGTGGGAAG AAAAAATCAT TGAATTAGCA CAACACTTAG ATTCTTATAT 3 00 

CCCTGAGCCT GAGCGTGCGA TTGATAAACC TTTCTTATTA CCAATCGAAG 350 

25 ACGTATTCTC AATTTCAGGT CGTGGTACAG TAGTAACCGG TCGTGTTGAG 400 

CGTGGTATCA TCAAATCAGG TGAAGAAGTT GAAATCGTAG GGATTAAAGA 450 

AACGACAAAA ACAACAGTAA CCGGTGTTGA GATGTTCCGT AAACTATTAG 500 

ACGAAGGTCG TGCGGGTGAA AACGTAGGTG CCTTATTACG TGGTAC TAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTA GCGAAACCAG GTACAATTAC 600 

3 0. ACCACACACT GATTTTGAAT CAGAAGTTTA TGTATTATCA AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ATCGTCCTCA GTTCTACTTC 700 

CGYACAACGG ACGTAACAGG AACGATTGAA TTACCTGAAG ATGTTGAGAT 750 

GGTAATGCCT GGTGATAATA TCAAGATGAC AGTAAGCTTA ATTCACCCTA 800 

TCGCGATGGA CGAAG 815 

35 



2) INFORMATION FOR SEQ ID NO: 92 

40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucle'ic acid " 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

'( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
50 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCAATG CCACAAACTC 50 

55 GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCATA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCAGGTGACG 2 00 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCATTAAA TGGCGTAGCA 2 50 

GAATGGGAAG AAAAAATCCT TGAGTTAGCA AACCACTTAG ATACTTACAT 300 

60 CCCAGAACCA GAGCGTGCAA TTGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 

56 
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ATGTGTTCTC AATCTCAGGT 
CGTGGTATCA TCCGTACTGG 
TACAGCAAAA ACTACTGTAA 
ACGAAGGTCG TGCAGGTGAA 
5 CGTGAAGAAA TCGAACGTGG 
GCCACACACT GACTTCGAAT 
GTGGTCGTCA TACTCCATTC 
CGTACAACTG ACGTAACTGG 
GGTAATGCCA GGCGATAACA 
10 TCGCGATGGA CCAAGGTTTA 



CGTGGTACAG TAGTAACTGG 
TGATGAAGTA GAAATCGTAG 
CGGGTGTTGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCGAAACCAG 
CAGAAGTTTA TGTATTATCA 
TTCAAAGGTT ACCGTCCACA 
TACTATCGAG TTACCAGAAG 
TCAAGATGAC AGTAAGCTTA 
CGTTTCGCAA 



TCGTGTAGAA 40 0 

GTATCAAAGA 450 

AAATTACTTG 500 

TGGTACCAAA 550 

GTTCAATCAC 60 0 

AAAGAAGAAG 65 0 

ATTCTATTTC 7 0 0 

GCGTAGAAAT 75 0 

ATCCACCCAA 80 0 

830 
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2) INFORMATION FOR SEQ ID NO: 93 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

30 TCTTAGTAGT AGCAGCAACA GACGGTCCAA TGCCACAAAC TCGTGAGCAC 50 
ATCTTATTAG GTCGCCAAGT AGGTGTTCCA TACAT CATCG TATTCTTAAA 100 
CAAATGCGAT ATGGTTGACG ATGAAGAATT ATTAGAATTA GTTGAAATGG 150 
AAGTGCGTGA ACTTCTTTCA CAATATGACT TCCCAGGTGA TGACACGCCA 2 00 

GTAGTACGTG GTTCAGCGTT ACAAGCGTTA AACGGCGTAG CAGAGTGGGA 250 
35 AGAAAAAATT CTTGAATTAG CAAACCACTT AGATACATAC ATCCCAGAGC 3 00 

CAGAGCGTGC GATTGATAAA CCATTCTTAT TACCAATCGA AGACGTATTC 3 50 

TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGTCGTGTTG AGCGTGGTAT 400 
CATCAAAGCG GGTGAAGAAG TTGAAATCGT AGGTAT CAAA GACACTGCGA 450 
AAACAACAGT AACTGGCGTG GAAATGTTCC GTAAATTATT AGACGAAGGT 50 0 

40 CGTGCGGGTG AAAACGTTGG TGCATTATTA CGTGGTACAA AACGTGAAGA 550 
AATCGAACGT GGTCAAGTGT TAGCGAAACC AGGTACAATT ACACCACACA 600 

CAGACTTCGA ATCAGAAGTG TACGTATTAT CAAAAGAAGA AGGTGGT CGT 650 

CACACTCCAT TCTTCAAAGG TTACCGTCCA CAATTCTACT TCCGTACAAC 700 

TGACGTAACT GGTACTATTG AATTACCAGA AGGCGTAGAA ATGGTAATGC 750 

45 CAGGCGATAA CATCAAAATG ACAGTATCAT TAATCCACCC AATCGCGATG 80 0 

GACGAAGGTT TACGTTTTGC GATT 824 



50 2 ) INFORMATION FOR SEQ ID NO: 94 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

57 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) ORGANISM: Haemophilus parainfluenzae 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

5 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTG 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 2 00 

10 ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTWGCA 2 50 

GAATGGGAAG AAAAAATCCT TGAATTAGCT AGCCACTTAG ATTCTTACAT 3 00 

TCCTGAGCCT CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 4 00 

CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 4 50 

15 GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACYAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

20 CGTACAACTG ACGTAACCGG AACTATCGAA TTAC CGGAAG GCGTGGAAAT 7 50 

GGTTATGCCT GGTGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 833 



25 



35 



2) INFORMATION FOR SEQ ID NO: 95 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 824 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus paraphrophilus 

(B) STRAIN: ATCC 2 9241 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTcT " S0~ 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 2 00 

ATACGCCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTGG ATACTTACAT 3 00 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 4 00 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 450 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTAC CAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

55 GTGGTCGTCA TACTCCATTC . TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GTGTGGAAAT 750 

GGTAATGCCT GGCGATAACA TCAAAATGAC CGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTT 824 



60 
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2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus segnis 

(B) STRAIN: ATCC 33393 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



GCTATCTTAG - TAGTAGCAGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

2 0 TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTC CCAG GTGATGATAC 2 00 

TCCAATCATT CGTGGTTCTG CATTACAAGC GTTAAACGGC GTAGCAGAAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCTCAAG CATTAGATAC TTACATTCCT 3 00 

GAACCTGAGC GTGCAATCGA CCAACCGTTC CTTCTTCCAA TTGAAGACGT 3 50 

25 GTTCTCAATC TCTGGTCGTG GTACTGTAGT AACAGGTCGT GTAGAGCGTG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTTGGTAT CAAACCAACT 450 

GCGAAAACAA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCA GGTGAAAACA TCGGTGCATT ATTACGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAA GTATTAGCGA AACCGGGTTC AATCACTCCA 600 

30 CACACTGACT TCGAATCTGA AGTGTACGTA TTATCTAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 7 00 

CAACTGACGT AACCGGTACT ATCGAGTTAC CGGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATCC ACCCAATCGC 800 

GATGGACCAA GGTTTACG 818 

35 



2) INFORMATION FOR SEQ ID NO : 97 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid" 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 
50 (B) STRAIN : ATCC 13337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

CGGCGCTATC CTGGTTGTTG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

55 GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTW CAT CATC GTR 100 

TTCCTGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCWGGYGATG 2 00 

ATACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGYGMACCT 250 

GAGTGGGAAG CTAAGATCGT AGAACTGGCT GAAACTCTGG ATTCTTACAT 3 00 

60 YCCACARCCA GAACGTGCTA TCGAYAAGCC ATTCCTGCTG CCAAT CGAAG 3 50 

59 
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ACGTATTCTC TATCTCTGGC CGTGGTACWG 
CGCGGTATCG TTAAAGTTGG TGAAGAAGTT 
TACCGTTAAA TCAACTTGTA CCGGCGTTGA 
ACGAAGGTCG TGCAGGCGAG AACGTTGGTG 
5 CGTGAAGACA TCGAACGTGG TCAGGTTCTG 
GCCACACACC AAGTTCGAAT CAGAAGTTTA 
GCGGYCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGC 



TTGTTACCGG TCGTGTAGAG 4 00 

GARATCGTTG GTATCAAAGA 450 

AATGTTCCGT AAACTGCTGG 500 

TTCTGCTGCG TGGTATCAAG 550 

GCTAAACCAG GYTCYATCAA 600 

TATTCTGAGC AAAGATGAAG 650 

ACCGTCCACA GTTCTACTTC 700 

TTGCCAGAAG GCGTGGAAAT 750 

763 



2) INFORMATION FOR SEQ ID NO: 98 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella Jcingae 
25 (B) STRAIN: ATCC 23330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

CGGCGCAATC TTGGTATGTT CAGCAGCTGA CGGTCCTATG CCACAAACTC 50 

3 0 GCGAACACAT CTTGTTGGCT CGCCAAGTAG GTGTACCTTA TATCATCGTA 10 0 

TTCATGAACA AATGCGACAT GGTCGATGAT GCTGAGTTGT TGGAATTGGT 150 

TGAAATGGAA ATCCGTGACT TGTTGTCTAG CTACGATTTT CCAGGCGACG 200 

ATTGCCCAAT CGTTCAAGGT TCTGCATTGC GYGCATTGGA AGGCGACGCT 250 

GCATACAAAG AAAAAATCTT TGAATTGGCT GCTGCTTTGG ATAGCTACAT 30 0 

35 TCCTACTCCA GAACGTGCTG TTGATAAACC ATTCTTGTTG CCAATCGAAG 350 

ATGTATTCTC TATCTCTGGT CGTGGTACAG TAGTTACTGG TCGTGTAGAG 40 0 

CGCGGTATCA TCAAAGTAGG CGAAGAGATT GAAATCGTTG GTTTGAAAGA 450 

CACGCAAAAA ACCACTTGTA CTGGCGTGGA AATGTTCCGC AAATTGTTGG 500 

ACGAAGGTCA AGCTGGTGAT AACGTTGGTG TATTATTGCG TGGTACGAAG 550 

40 CGTGAAGACG TTGAACGTGG TCAGGTATTG GCTAAACCAG GTTCTATCAC 60 0 

TCCGCACACT AAATTTGAAG CTGAAGTGTA TGTGTTGAGC AAAGAAGAAG 650 

GTGGCCGTCA ~TACGO^Ttc~ /o"u " 

CGTACGACTG ACGTAACTGG TGCAGTTACT TTGTCTGAGG GTGTGGAAAT 750 

GGTTATGCCA GGCGAAAACG TGAAAATCAC TGTTGAGTTG ATTGCACCTA 800 

45 TCGCTAGGAA AACGGTTTGC GTTTTGCG 82 8 



50 



2 ) INFORMATION FOR SEQ ID NO : 99 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

{ i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Klebsiella oirnithinolytica 

60 
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(B) STRAIN : ATCC 318 98 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

5 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTT CCGTAC AT C ATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAGCTGGTT 150 

GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CCGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

10 ACTGGGAAGC GAAAATCATC GAACTGGCTG GCTACC TGGA TTCTTACATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTC CGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 4 50 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

15 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGG CTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

2 0 GTCATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TTCACCCGAT 8 00 

CGCGATGGAC GATGGTCTGC GTTTCGCA 828 



25 2) INFORMATION FOR SEQ ID NO: 100 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 9 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klehsiella axytoca 

(B) STRAIN: ATCC 3 3496 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 



GATGCCGCAG ACTCGTGAGC ACATCCTGCT GGGTCGTCAG GTAGGCGTTC * 50 

CGTACATCAT CGTGTTCCTG I^CAAGTGCG" \6^G(yff^~i:^C(^^GA6 " loV ~ 

CTGCTGGAAC TGGTTGAAAT GGAAGTTCGT GAACTTCTGT CTCAGTACGA 150 

TTTCCCGGGC GACGACACTC CGATCGTTCG TGGTTCTGCT CTGAAAGCGC 2 00 

45 TGGAAGGCGA CGCWGAGTGG GAAKCKAAAA TCATCGAACT GGCTGGCTTC 2 50 

CTGGATTCTT ACATTCCGGA ACCAGAGCGT GCGATTGACA AGCCGTTCCT 3 00 

GCTGCCGATC GAAGACGTAT TCTCCATCTC CGGTCGTGGT ACCGTTGTTA 350 

CCGGTCGTGT AGAGCGCGGT ATCATCAAAG TTGGCGAAGA AGTTGAAATY 4 00 

GTTGGTATYA AAGACACTGC TAAGTCTACC TGTACTGGCG TTGAAATGTT 4 50 

50 CCGCAAACTG CTGGACGAAG GCCGYGCTGG TGAGAACGTT GGTGTTCTGC 500 

TGCGTGGTAT CAAACGTGAA GAAATCGAAC GTGGTCAGGT ACTGGCTAAG 550 

CCGGGCTCTA TCAAGCCGCA CAC CAAGTTC GAAT CTGAAG TTTATATCCT 600 

GTCCAAAGAC GAAGGCGGCC GTCACACTCC GTTCTTCAAA GGCTACCGTC 650 

CGCAGTTCTA CTTCCGTACA ACTGACGTGA CTGGCACCAT CGAACTGCCG 700 

55 GAAGGCGTAG AGATGGTTAT GCCGGGCGAC AACATCAAAA TGGTTGTTA 749 



60 



2) INFORMATION FOR SEQ ID NO: 101 

61 
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. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 3 3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

15 TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TGCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCCGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

2 0 GACTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCTATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACTG CTAAR TCWACCTGTA CYGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

25 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCTATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 65 0 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

3 0 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATTCACCCRA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



35 2) INFORMATION FOR SEQ ID NO: 102 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella, pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 112 96 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAGATGGA 150 

AGTTCGTGAA CTGCTGTCTC AGTACGATTT CCCGGGCGAC GACACCCCGA 200 

55 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCCACCTG GATACCTATA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTAG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AAACCGCGAA 450 

60 AACCACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

62 
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GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
5 GACGTGACTG 
GGGCGACAAC 
ACGACG 



GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GCACCATCGA 
ATCAAAATGG 



GTTCTGCTGC 
GGCTAAGCCG 
ACATCCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GTGGTATCAA 
GGCACCATCA 
CAAAGACGAA 
AGTTCTACTT 
GGCGTAGAGA 
GATCCACCCG 



ACGTGAAGAA 
ACCCGCACAC 
GGCGGCCGTC 
CCGTACTACT 
TGGTAATGCC 
ATCGCGATGG 



550 
600 
650 
700 
750 
800 
806 



10 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 103 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN; ATCC 13883 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 



GCAGACTCGT 
TCATCGTGTT 
GAACTGGTTG 

30 GGGCGACGAC 
GCGACGCAGA 
ACCTATATCC 
GATCGAAGAC 
GTGTAGAGCG 

35 ATCAAAGAAA 
ACTGCTGGAC 
GTATCAAACG 
ACCATCAACC 
AGACGAAGGC 

40 TCTACTTCCG 
GTAGAGATGG 



GAGCA CATCC 
CCTGAACAAA 
AGATGGAAGT 
ACTCCGATCG 
GTGGGAAGCG 
CGGAACCAGA 
GTATTCTCCA 
CGGTATCATC 
CCGCGAAAAC 
GAAGGCCGTG 
TGAAGAAATC 
CGCACACCAA 
GGCCGTCACA 
TACTACTGAC 
TAATGCCGGG 



TGCTGGGTCG 
TGCGACATGG 
TCGTGAACTG 
TTCGTGGTTC 
AAAATCATCG 
GCGTGCGATT 
TCTCCGGTCG 
AAAGTAGGTG 
CACCTGTACT 
CTGGTGAGAA 
GAACGTGGTC 
GTTCGAATCT 
CTCCGTTCTT 
GTGACTGGCA 
CGACAACATC 



TCAGGTAGGC 
TTGATGACGA 
CTGTCTCAGT 
TGCTCTGAAA 
AACTGGCTGG 
GACAAGCCGT 
TGGTACCGTT 
AAGAAGTTGA 
GGCGTTGAAA 
CGTAGGTGTT 
AGGTACTGGC 
GAAGTGTACA 
CAAAGGCTAC 
CCATCGAACT 
AAAATGGTTG 



GTTCCGTACA 50 

AGAGCTGCTG 10 0 

ACGATTTCCC 150 

GCGCTGGAAG 200 

CCACCTGGAT 250 

TCCTGCTGCC 3 00 

GTTACCGGTC 35 0 

AATCGTTGGT 4 0 0 

TGTTCCGCAA 450 

CTGCTGCGTG 500 

TAAGCCGGGC 55 0 

TCCTGTCCAA 60 0 

CGTCCGCAGT 650 

GCCGGAAGGC 700 

TTA 743 



45 2) INFORMATION FOR SEQ ID NO: 104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13884 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

63 
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TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAGATGGAAG 150 

5 TTCGTGAACT RCTGTCTCAG TACGATTTCC CGGGCGACGA CACCCCGATC 200 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

GAAAATCATC GAACTGGCTG GCCACCTGGA TACCTATATC CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC GCGGTATCAT 400 

10 CAAAGTAGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAA ACCGCGAAAA 450 

CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCTGGTGAGA ACGTAGG TGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG CGGCCGTCAC 650 

15 ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

GCGACAACAT CAAAATGGTT GTTACCCTGA TCCATGCGAT CGCGATGGAC 800 

GACGGTCTGC GTTTCGCAA 819 

20 

2 ) INFORMATION FOR SEQ ID NO: 105 



(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 ( i i ) MOLE CULE . TYPE : Genomi c DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera. ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA TGGCCCTATG CCACAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

40 TTCCTGAACA AATGYGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 200 

ATACTCCAAT CATC C GT GGT T CTG^^fTGA~ AAGCGC T^^G^ ~AGGcZj^^ 7 (icA 2blf " 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAACGTGCTA TCGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

45 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCTATCAA 600 

50 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTT CGCAA CC 832 

55 

2 ) INFORMATION FOR SEQ ID NO : 106 
60 ( i ) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 
10 (B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACTC 50 

15 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCTAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGACGCT 250 

GAGTGGGAAG CAAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

20 CCCAGAACCA GAGCGTGCGA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATC GTGG GTATCAAAGA 4 50 

CACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 5 00 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

25 CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCCATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TAC CATCGAA CTGC CAGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCAA 800 

3 0 TCGCGATGGA CGACGGTCTG CGTTT CGCAA 830 



35 



2 ) INFORMATION FOR SEQ ID NO : 10 7 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TY r PE:~ Genomi c DNA~~ 

(Vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

50 CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGACGACA 2 00 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCTGAG 250 

55 TGGGAAGCGA AAATCATCGA ACTGGCGGGC TTCCTGGATT CTTACATCCC 3 00 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAACGC 400 

GGTATCATCA AAGTTGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGACAC 450 

CGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

60 AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

65 
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GAAGAAATCG AACGTGGTCA GGTACTGGCT 
GCACACCAAG TTCGAATCTG AAGTGTACAT 
GCCGTCATAC TCCGTTCTTC AAAGGCTACC 
ACTACTGACG TGACTGGCAC CATCGAACTG 
5 AATGCCGGGC GACAACATCA AAATGGTTGT 
CGAAGGACGA AGGTCTGCGT TTCGCA 



AAGCCGGGTT CTATCAAGCC 600 
TCTGTCCAAA GACGAAGGCG 650 
GTCCGCAGTT CTACTTCCGT 700 
CCGGAAGGCG TTGAGATGGT 750 
TACCCTGATC CACCCGATCG 800 

826 



10 2) INFORMATION FOR SEQ ID NO: 108 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 80 3 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus ca.se! subsp. casei 

(B) STRAIN: ATCC 393 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 



GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACATATCT TACTTTCACG 50 

TCAAGTTGGT GTTCCATACA TCGTTGTATT CATGAACAAA TGTGACATGG 10 0 

TTGACGATGA AGAATTACTA GAATTAGTTG AAATGGAAAT TCGTGATCTA 150 

TTAACTGAAT ATGAATTCCC TGGCGATGAC ATTCCTGTAA TCAAAGGTTC 200 

30 AGCTCTTAAA GCACTTCAAG GTGAAGCTGA CTGGGAAGCT AAAATTGACG 250 

AGTTAATGGA AGCTGTAGAT TCTTACATTC CAACTCCAGA ACGTGATACT 3 00 

GACAAACCAT TCATGATGCC AGTTGAGGAT GTATTCTCAA TCACTGGTCG 350 

TGGAACAGTT GCAACTGGAC GTGTTGAACG TGGACAAGTT AAAGTTGGTG 400 

ACGAAGTAGA AGTTATCGGT ATTGAAGAAG AGAGCAAAAA AGTAGTAGTA 45 0 

35 ACTGGAGTAG AAATGTTCCG TAAATYACTA GATTACGCTG AAGCTGGCGA 500 

CAACATTGGC GCACTTCTAC GTGGTGTTGC TCGTGAAGAT ATCCAACGTG 550 

GTCAAGTATT AGCTAAACCA GGTTCGATTA CTCCACACAC TAACTTCAAA 600 

GCTGAAACTT ATGTTTTAAC TAAAGAAGAA GGTGGACGTC ACACTCCATT 650 

CTTCAACAAC TACCGCCCAC AATTCTATTT CCGTACTACT GACGTAACTG 70 0 

40 GTATTGTTAC ACTTCCAGAA GGTACTGAAA TGGTAATGCC TGGTGATAAC 750 

ATTGAGCTTG CAGTTGANCT AATTGCACCA ATCGCTATCG AAGACGGTAC 80 0 

TAA~ au Y~ 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 109 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 825 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(B) STRAIN: ATCC 19435 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

66 
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CGGTGCAATC CTCGTTGTTG CTGCAACTGA TGGACCAATG CCACAAACTC 50 
GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC . 100 

TTCCTTAACA AGGCTGACCT TGTTGATGAT GAAGAATTGA TGGAACTCGT 150 

5 TGAAATGGAA GTTCGTGACC TCTTGAGCGA ATACGACTTC CCAGGTGACG 2 00 

ATATTCCTGT AATCGCTGGT TCAGC AC TTG GTGCTTTGAA CGGTGAACCA 250 

CAATGGGTTG CTAAAGTTGA AGAATTGATG GACATCGTTG ATGAATACAT 300 

CCCAACTCCA GAACGCGACA CTGACAAACC ACTCCTTCTT CCAGTCGAAG 350 

ACGTATTCTC TATCACTGGT CGTGGTACAG TTGCTTCAGG ACGTATCGAA 4 00 

10 CGTGGTACTG TTAAAGTTGG TGACGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

AGAAACTAAA AAAGCTGTTG TTACTGGTAT CGAAATGTTC CGTAAAACAC 500 

TTACTGAAGG TCTTGCTGGT GATAACGTCG GTGCACTTCT CCGTGGTATC 550 

CAACGTGACG AAATCGAACG TGGTCAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CACTCCACAC AAACTTTTCG AAGGTGAAGT TTACGTATTG AGCAAAGAAG 650 

15 AAGGCGGACG TCACACTCCA TTCTTCGACA ACTACCGTCC TCAATTCTAC 700 

TTCCACACAA CTGACGTTAC TGGTTCAGTT AAACTTCCAG AAGGAAC TGA 750 

AATGGTAATG CCTGGTGACA ACGTGCATAT CGACGTTGAA TTGATCCACC 800 

CAGTTGCGAT CGAACAAGGT ACTAC 825 

20 



2) INFORMATION FOR SEQ ID NO: 110 

( i ) SEQUENCE CHARACTER I STI CS : 
25 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTTC ATCATCGTGT 100 

4 0 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGTGAACT YCTGTCCCAG TACGACTTCC CGGGCGACGA 200 

CACCCCAATC GTTCGTGGTT CTGCG^TGAA AG~CGCTC^AA ~GGCG/^GCAg" ~ 2bU~ ' 

AGTGGGAAGA GAAAATCATC GT^RCTGGCTG GCTACCTGGA TTCCTACATC 3 00 

CCAGAGCCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

45 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAARGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAGGAC 450 

ACTGCTAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCYATCAAG 600 

50 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCYA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACKACTGA CGTGACCGGT ACCATCGARC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCAATGGAC GATGGTCTGC GTTC 824 

55 

2) INFORMATION FOR SEQ ID NO: 111 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella micdadei 
10 (B) STRAIN: ATCC 33218 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

CGGAGCGATA TTAGTAGTAT CAGCAG CGGA TGGCCCAATG CCTCAAACGA 50 

15 GAGAGCACAT ACTYTTATCC CGS CAGGTAG GTGTTCCCTA TATAGTAGTG 100 

TTCTTAAACA AAGCTGACAT GGTGGATGAT GCGGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTACGCGAYT TGTTGAGCAG CTATGAATTT CCAGGAGATG 200 

AGATCCCGAT TGTAGTTGGT TCAGCATTAA AAGCATTGGA AGGCGATACG 250 

AGTGATATAG GTGTACCAGC GATTGAGAAG TTAGTTGAGA CGATGGATTC 3 00 

2 0 TTATATACCT GAG C CGGTAA GAAACATCGA TAAAAGTTTC TTGTTACCGA 3 50 

TCGAAGACGT GTTCTCAATA TCTGGACGAG GAACAGTAGT AACAGGACGT 4 00 

ATCGAAAGCG GGATCATCAA AGTTGGTGAG GAAGTCGAGA TTGTTGGTAT 450 

ACGTGACACT CAAAAGACGA CATGCACAGG CGTTGAAATG TTCCGTAAAT 500 

TACTTGACGA AGGTCGAGCT GGAGACAACG TTGGTATATT GCTACGTGGT 550 

25 ACGAAGCGGG ATGAAGTTGA ACGCGGACAA GTATTAGCTA AGCCGGGAAG 600 

CATTAAACCG CATACTAAAT TTGAAGCTGA AGTGTATGTG TTGTCAAAAG 650 

ATGAAGGTGG ACGTCATACC CCATTCTTTA ACGGATATCG GCCTCAATTT 700 

TACTTCAGGA CCACAGACGT AACTGGTTCT TGTGATTTAC CTGARGGTAT 750 

AGAAATGGTA ATGCCAGGTG ATAACGTCAA GCTGATTGTT AGCTTACACT 800 

3 0 CACCGATTGC TATGGACGAA GGTTTGCGTT TTGCAATC 838 



35 



2 ) INFORMATION FOR SEQ ID NO: 112 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Legionella, pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

50 CGGAGCGATA CTGGTTGTAT CAGCAGCTGA TGGTCCTATG CCACAAACGA 50 

GGGAACACAT TCTATTGTCT CGCCAGGTAG GTGTTC CATA TATTGTTGTG 100 

TTCATGAACA AAGCGGATAT GGTTGATGAC CCTGAGTTAT TAGAGTTAGT * s 150 

GGAAATGGAA GTGCGAGATT TATTAAGCAG TTACGATTTC CCAGGGGATG 2 00 

ACATACCTAT TGTTGTTGGT TCAGCTTTGA AAGCATTGGA AGGTGAAGAC 250 

55 AGTGATATAG GCGTTAAGGC TATTGAGAAA TTGGTTGAAA CAATGGATTC 3 00 

ATACATTCCT GAGCCAGTTA GAAACATAGA CAAGCCATTT TTGTTGC CGA 3 50 

TTGAAGACGT ATTTTCAATT TCTGGACGCG GAACAGTGGT AACTGGTCGT 400 

GTAGAGAGTG GAATTGTTAA AGTTGGTGAG GAAGTTGAAA TTGTTGGAAT 4 50 

AAGAGACACC CAAAAGACGA CTTGTACGGG TGTTGAGATG TTCCGTAAAT 500 

60 TACTTGATGA AGGTCGAGCT GGTGATAACG TTGGTGTGTT ATTACGAGGT 550 

68 
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ACGAAGCGAG ATGAAGTGGA GCGTGGACAG 
CAT CAAGC CA CACACCAAGT TTGAAGCAGA 
AAGAAGGCGG ACGTCACACT CCATTCTTTA 
TATTTCAGAA CCACTGACGT GACAGGTACT 
5 TGAAATGGTA ATGCCTGGAG ATAATGTGCA 
CTCCGATTGC GATGGATGAA GGTTTAAGAT 



GTATTGGCGA AGCCAGGAAC 600 

AGTGTATGTA TTATCCAAGG 650 

ATGGATACCG TCCACAATTC 700 

TGTGACTTGC CATCAGGAGT 750 

ATTAGTTGTT AGCTTGCATG 800 

TCGCAATT 838 



10 2 ) INFORMATION FOR SEQ ID NO: 113 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorel la grimontii 

(B) STRAIN: ATCC 33999 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

GTGCAATCCT GGTAGTAGCA GCGACTGACG GCCCGATGCC TCAGACTCGC 50 

GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA TCATCGTATT 100 

CCTGAACAAG TGCGATATGG TTGATGACGA AGAGC TGCTG GAGCTGGTTG 150 

ARATGGAAGT TCGCGAACTG CTGTCTCAGT ACGACTTCCC GGGCGACGAC 2 00 

3 0 ACTC CGGTAG TCCGCGGTTC AGCGCTGAAA GCGCTGGAAG GCGAAGCCGA 250 

GTGGGAARCG AAAATCATCG AGCTGGCAGG CCMTCTGGAT ACTTATATCC 30 0 

CAGAAC CTGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC KATCGAAGAC 350 

GTATTCTCTA TCTCCGGCCG TGGTACCGTT GTTACCGGTC GTGTAGAGCG 400 

CGGCATCATC AAAGTCGGTG AAGAAGTGGA AATCGTCGGT ATCAAAGATA 45 0 

35 CCACCAAGAC CACCTGTACC GGCGTTGAAA TGTTCCGTAA GCTGCTGGAC 500 

GAAGGCCGTG CGGGCGAGAA CGTGGGCGTT CTGCTGCGCG GTACCAAGCG 550 

TGACGAAATC GAACGTGGTC AAGTTCTGGC CAAGCCGGGC ACCATCACTC 600 

CTCACACCCA GTTCGTGTCA GAAGTGTATA TCCTGAGCAA GGATGAAGGC 650 

GGCCGTCATA CTCCGTTCTT CAAAGGCTAC CGTCCTCAGT TCTACTTCCG 700 

40 TACGACTGAC GTGACAGGCA C CATCGAACT GCCGGAAGGC GTAGAGATGG 750 

TAATGCCAGG CGACAACATT CAGATGACCG TAAGTCTGAT TGCGCCGATC 800 

GCAATGGACG ^GGTCTGCG" CTTCGCAA " ~ 



15 



20 



25 



45 



55 



2) INFORMATION FOR SEQ ID NO: 114 



( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorel la richardii 

(B) STRAIN: ATCC 33998 

60 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

69 
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GCTATCCTGG TTGTTGCTGC GACTGACGGC CCAATGCCTC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGCC AGGTAGGCGT TCCTTACATC ATCGTGTTCC 100 

TGAACAAGTG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 1.50 

5 ATGGAAGTTC GTGAACTTCT GTCTCAATAC GACTTCCCGG GCGACGATAC 2 00 

GCCGGTTGTT CGCGGTTCAG CGCTGAAAGC . GCTGGAAGGT GACGCYGAGT 250 

GGGAARCGAA AATCATTGAA CTGGCGGAAT CCTTRGATAC TTAYATTCCA 3 00 

GAGCCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTG CCTA TCGAAGACGT 350 

TTTCTCTATC TCTGGCCGTG GTACTGTAGT CACCGGTCGT GTAGAGCGCG 400 

10 GCATCATCAA AGTTGGTGAA GAAGTGGAAA TCGTGGGAAT CAAAGACACC 450 

ACCAAGACCA CCTGTACTGG CGTTGAAATG TTCCGTAAGC TGCTGGACGA 500 

AGGCCGTGCA GGTGAGAACG TTGGTGTTCT GCTGCGYGGT ACTAAGCGTG 550 

ACGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCAGGCAC CATCACTCCT 600 

CACACAGAAT TCGTGTCAGA AGTGTATATC CTGAGCAAGG ATGAAGGCGG 650 

15 YCGTCATACT CCGTTCTTCA AAGGCTACCG TCCTCAGTTC TACTTCCGTA 700 

CGACTGACGT GACCGGCACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 750 

ATGCCAGGCG ATAACATCCA GATGGTAGTT ACGCTGATTG CCCCAATCGC 800 

GATGGACGAA GGTCTGCGCT TCGCAA 826 

20 



2 ) INFORMATION FOR SEQ ID NO : 115 

( i ) SEQUENCE CHARACTER! STI CS : 
25 (A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



3 0 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leptospira interrogans 

(B) STRAIN: ATCC 23 581 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



TGCGGCGATT CTTGTAGTAT CCGCAACTGA CGGACCTATG CCACAAACAA 50 

AAGAACATAT CCTTCTTGCT CGTCAGGTAG GTGTTCCATA TGTAATTGTA 100 

40 TTCATTAACA AAGCAGATAT GCTTGCTGCT GACGAAAGAG CAGAAATGAT 150 

CGAAATGGTT GAGATGGACG TTCGTGAACT TCTCAATAAG TATAGCTTCC 200 

CAGGAGATAC AACTCCTj^cT ^tTlX^I^GGTT CTGCC^GTAAA AUCALTimG " Z btj" * 

GGCGATGAAT CTGAAATTGG GATGCCTGCA ATTCTCAAAT TGATGGAAGC 300 

TCTGGATACT TTCGTTCCAA ATC CAAAACG TGTAATCGAC AAACCTTTCC 350 

45 TTATGCCAGT AGAAGACGTT TTCTCGATCA CTGGTCGTGG AACTGTTGCA 4 00 

ACTGGAAGAG TGGAACAAGG TGTTTTGAAA GTGAACGACG AAGTTGAAAT 450 

TATCGGTATC CGCCCAACAA CAAAAACTGT TGTTACCGGT ATCGAAATGT 5 00 

TCAGAAAACT TCTCGATCAA GCGGAAGCTG GCGACAACAT CGGCGCTCTT 550 

CTTCGTGGAA CTAAAAAAGA AGAAATCGAA AGAGGGCAAG TTCTTGCGAA 600 

50 GCCAGGTTCT ATCACTCCTC ACAAAAAGTT TGCCGCTGAG GTGTATGTAT 650 

TAACTAAGGA TGAAGGCGGA CGTCATACTC CGTTTATCAA TAACTACCGT 700 

CCTCAGTTTT ACTTTAGAAC AACTGACGTA ACCGGAGTTT GTAACCTTCC 750 

TAATGGTGTC GAAATGGTTA TGCCTGGTGA TAACGTTTCT TTGACGGTTG 800 

AATTGATTAG CCCGATCGCA ATGGACAAGG GTCTTAAGTT CGC 843 

55 



2) INFORMATION FOR SEQ ID NO: 116 
60 (i) SEQUENCE CHARACTERISTICS: 

70 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Megamonas hypermegale 
10 (B) STRAIN: ATCC 25560 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

CGGTGCTATC CTCGTTGTTA GTGCTGCTGA TGGTCCTATG CCTCAGACTC 50 

15 GTGAACACAT CCTTCTCGCT CGTCAGGTTG GTGTTCCAGC TATCGTTGTA 100 

TTCCTCAACA AAGCTGAC C A GGTTGATGAC CCTGAACTTC TCGAACTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCCAG CTATGACTTC CCAGGCGATG 2 00 

ACGTTCCAGT AATCACTGGT TCCGCTCTTC AGGCTCTCGA AGGCGACGAA 250 

GAAGCTAAAA AGAAAATTCT TGAATTAATG GATGCTGTTG ATGATTACAT 3 00 

20 CCCAACTCCA ACACGTGACA CTGATAAACC TTTC TTAATG CCAGTTGAAG 3 50 

ACGTATTCAC AATTACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGCGAAC TTAAACTTGG TGACAGCGTT GAAATCGTTG GTCTTTCCGA 4 50 

TGAAAAGAAA TCCACTACTG TAACTGGTAT CGAAATGTTC CGCAAAATGC 500 
TTGATAGCGC TGTTGCTGGT GATAACATCG GTGCACTTCT TCGTGGTATT ' 550 

25 GACCGTAAAG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CTGGCACAAT 600 

TCATCCACAC AAAAAATTCA AAGCTCAGGT TTACGTATTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCTCCA ACTATCGTCC ACAGTTCTAT 7 00 

TTCCGTACTA CTGACGTTAC TGGTGTTGTA ACTCTTCCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACATTGAAAT GAGCATCGAA CTCATCACTC 800 

3 0 CAATCGCTAT TGAAAAAGGT CTTCGCTTCG CT 832 



35 



2) INFORMATION FOR SEQ ID NO: 117 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
4 0 <D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Mitsuokella multacida 

(B) STRAIN: ATCC 27723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 

50 TGGTGCTATC CTCGTCGTTT CCGCTGCTGA TGGCCCGATG CCGCAGACGC 50 

GTGAGCACAT CCTGCTCGCT CGCCAGGTCG GTGTTCCGGC AATCGTTGTC 100 

TTCCTCAACA AGGTTGACCA GGTTGACGAT CCGGAGCTCC TCGAGCTCGT 150 

CGAGATGGAA GTTCGCGAGC TGCTCTCCAG CTACGACTTC CCGGGCGATG 200 

ACATCCCTGT AATCGCTGGT TCCGCTCTGA AGGCCCTCGA AGGCGACGAA 2 50 

55 GAGCAGAAGA AGAACATCCT CAAGCTCATG GAAGCTGTCG ATGAGTACAT 3 00 

CCCGACGCCG GTCCGCGACA ACGCTAAGCC GTTCCTGATG CCGGTCGAGG 350 

ATGTCTTCAC GATCACGGGC CGTGGTACGG TTGCAACGGG CCGCGTTGAG 4 00 

CGTGGTGAGC TCAAGATGAA CGATACGGTT GAGATCGTTG GTCTGCAGGA 450 

CGAGCCGCGT CAGACGGTTG TCACGGGCAT CGAGATGTTC CGCAAGATGC 500 

60 TTGATTTCGC TGAGGCTGGC GATAACATCG GTGCTCTGCT CCGTGGTATC 550 
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GACCGCAAGG AGATCGAGCG TGGCCAGGTT 
TCATCCGCAC ACGAAGTTCA AGGCTCAGGT 
AAGGCGGCCG TCATACGCCG TTCTTCACGA 
TTCCGCACGA CGGACGTAAC TGGCGTAGTC 
5 GATGGTTATG CCTGGCGATA ACGTCGAGAT 
CGATCGCTAT CGAGAAGGGC 



CTCGCAAAGC CGGGCACGAT 600 
CTATGTCCTG ACGAAAGAAG 650 
ACTATCGCCC GCAGTTCTAC 700 
AAACTGCCGG AAGGCACGGA 750 
GGAAGTTGAG CTCATCACCC 8 00 

820 



10 2) INFORMATION FOR SEQ ID NO r 118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mobiluncus curtisii subsp. holmes ii 

(B) STRAIN: ATCC 3 5242 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

25 

CGGCGCTATC CTCGTGGTGG CTGCTACTGA CGGTCCGATG GCTCAGACCA 50 

AGGAACACAT CCTGTTGGCT AAG CAGGTTG GCGTGCCCTC CATCCTGGTC 100 

GCTCTGAACA AGTGCGATTC TTC CGATGTG GACGAAGACA TGCTCGAAAT 150 

CGTCGAGGAC GAAATCCGCG ATGACCTGGA GAAGCAGGGC TTCGATCGTG 2 00 

3 0 AC TGCC CGAT TATCCACGTT TCCGCTCTGA AGGCCCTGGA AGGCGACCCC 250 

GAGTGGACCA AGAAGATTGA AGAGCTCATG GAAGCGGTCG ATACCTACAT 300 

TCCTGAGCCT GTTCGTGACC TCGACAAGCC GTTCTTGATG CCTATCGAAG 350 

ACGTCTTCAC CATTACTGGT CGCGGTACCG TAGTGACCGG TCGTGTGGAA 400 

CGCGGCAAGC TACCGTTGAA CGCCGAAGTG GAAATCGTAG GTATTCGTCC 450 

35 TACGCAAAAG ACCACCGTTA CCGGTATCGA AATGTTCCAC AAGTCCATGG 500 

ACGAAG CCTA CGCCGGCGAG AACTGTGGTC TGTTGCTGCG TGGCACCAAG 550 

CGTGAGGACG TTGAGCGCGG TCAGGTTGTC TGCATTCCTG GCTCCGTGAC 600 

CCCGCACACC AAGTTCGAGG GCAAGGTCTA CATCTTGAAG AAGGACGAAG 650 

GTGGACGTCA CAAGTCGTTC TACGACGGCT ACCGCCCGCA GTTCTTCTTC 700 

4 0 CGCACCACCG ACGTGACCGG TGTTATTCAC CTGCCCGAAG GCACCGAAAT 750 

GGTTATGCCT GGCGACACCA CCGAAATTAG CGTTGAGCTG ATTCAGCCTA 800 

TCGCTATGGA GGAAGGTCTcT GGCTTCGC1 ; a""t ~ 8jfl" 



45 



55 



2) INFORMATION FOR SEQ ID NO: 119 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 5 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella. wisconsensis 

(B) STRAIN: ATCC 35017 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

72 
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GGTGCAATTC TGGTTGTTGC TGCAACTGAT GGCCCTATGC CACAGACTCG 50 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG CGTTCCATAC ATCATCGTTT 100 

TCCTGAACAA ATGTGACATG GTAGACGACG AAGAGCTGTT AGAACTGGTT 150 

5 GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGATTTCC CAGGCGATGA 200 

CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCTCTGGAA GGCGAAGCTG 250 

AGTGGGAAGC TAAAATCATT GAACTGGCAG AAGCACTGGA TTCTTATATC 300 

CCAGAGCCAG AGCGTGACAT TGATAAGCCA TTCCTGTTAC CAATCGAAGA 350 

CGTATTCTCA ATTTCAGGCC GTGGTACAGT TGTTACTGGT CGTGTTGAGC 400 

10 GTGGTATCGT TAAAGTCGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAT 450 

ACCGTGAAAA CAACATGTAC TGGCGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAAC 550 

GTGATGATAT CGAACGTGGT CAAGTATTGG CTAAACCAGG TTCAATCACT 600 

CCGCATACAA CTTTCGAATC AGAAGTTTAC AT CCTGAGCA AAGATGAAGG 650 

15 TGGCCGTCAT ACTCCATTCT TCAAAGGTTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCAG GTGATAACAT CAAAATGATC GTTACTCTGA TCCACCCAAT 800 

TGCAATGGAT GCAGGTCTGC GTTTT 82 5 

20 



2) INFORMATION FOR SEQ ID NO: 12 0 

( i ) SEQUENCE CHARACTERI STI CS : 
25 (A) LENGTH: 82 7 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Branhamella catarrhal! s 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



TGGTGCTATC TTGGTTGTTT CTGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCATAT CCTACTATCT - CGTCAGGTTG GTGTACCATA CATCATGGTA 100 

• 40 TTCATGAACA AGTGCGATAT GGTTGATGAT GAAGAGCTAC TAGAATTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTATCTGA CTATGATTTC CCTGGTGATG 20 0 

ATACCCCAAT CATCAAAGGT TCAGCACTAG AAGCA^TrGAA TGUl'l^lXiAY ~ 2btf " 

GGTAAATATG GCGAGCCTGC AGTTCTAGAA CTGCTAGACA CACTAGACAG 300 

CTATATCCCA GAGCCTGAGC GTGATAT CGA TAAGTCATTC TTGATGCCAA 350 

45 TTGAAGATGT CTTCTCGATC TCAGGTCGTG GTACAGTTGT GACTGGTCGT 400 

GTTGAATCAG GTATTATTAA AGTTGGTGAT GAAATTGAAA TCATCGGTAT 450 

CAAACCAACT GCTAAAACCA CCTGTACTGG TGTTGAAATG TTCCGTAAAC 500 

TGTTAGACGA AGGTCGTGCA GGTGAGAACT GTGGTATCTT GTTGCGTGGT 550 

ACTAAGCGTG AAGAAGTTCA ACGCGGTCAA GTACTTGCAA AACCAGGTTC 600 

50 AATCACCCCA CATACTAAGT TTGATGCTGA AGTTTATGTA CTGTCAAAAG 650 

AAGAAGGTGG TCGTCACACC CCATTCTTAA ATGGCTATCG CCCACAGTTC 700 

TACTTC CGTA CCACAGATGT GACTGGTGCC ATCACTCTAC AAGAAGGTAC 750 

CGAAATGGTT ATGCCTGGTG ACAATGTTGA GATGAGTGTT GAGCTTATCC 800 

ACCCAATCGC CAGGATAAAG GTCTACG 82 7 

55 



2 ) INFORMATION FOR SEQ ID NO: 121 
60 (i) SEQUENCE CHARACTERISTICS: 

73 
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(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

{ i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 
10 (B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

CGGCGCTATC CTGGTTGTTG CTGCAACTGA TGGC CCTATG CCACAGACCC 50 

15 GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCCTGAACA AATGTGACAT GGTTGATGAT GAAG AG CTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCTGGCGACG 200 

ACACGCCAAT CGTTCGCGGT TCAGCGCTGA AAGCACTGGA AGGCGAGCCA 250 

GAGTGGGAAG CTAARATCGT TGAACTGGCA GGTTTCCTGG ATTCTTACAT 3 00 

2 0 CCCTGAGCCA GAGCGTGCAA TTGACAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC AATCTCCGGC CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

CGCGGTATCA TCAAGGTTGG TGAGGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA ACCACCTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCMGGTGAG AACGTCGGTG TTCTGCTGCG TGGTACCAAG 550 

25 CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAACCAG GTTCAATCAA 600 

ACCACAYACC AAATTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACCACAG ACGTAACAGG TAC TATCGAA CTGCCGGAAG GCGTTGAAAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGAT CGTCACCCTG ATCCACCCAA 800 

3 0 TCGCAA 806 



35 



2 ) INFORMATION FOR SEQ ID NO: 122 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: TB 2 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 

50 GGTGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CCCAGACCCG 50 

CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCTAC ATCCTGGTAG 100 

CGCTGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCTGCC CAGGAATTCG ACGAGGACGC 2 00 

CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT 2 50 

55 GGGTTGCCTC TGTCGAGGAA CTGATGAACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCGG TCGAGGACGT 350 

CTTCACCATT ACCGGCCGCG GAACCGTGGT CACCGGACGT GTGGAGCGCG 4 00 

GCGTGATCAA CGTGAACGAG GAAGTTGAGA TCGTCGGCAT TCGCCCATCG 450 

ACCACCAAGA CCACCGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

60 CCAGGGCCAG GCGGGCGACA ACGTTGGTTT GCTGCTGCGG GGCGTCAAGC 550 
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GCGAGGACGT CGAGCGTGGC CAGGTTGTCA 

CCGCACACCG AGTTCGAAGG CCAGGTCTAC 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA 

GCACCACCGA CGTGACCGGT GTGGTGACAC 

5 GTGATGCCCG G TGACAACAC CAACATCTCG 

CGCCATGGAC GAAGGTCTGC GTTTC 



CCAAGCCCGG CACCACCACG 600 
ATCCTGTCCA AGGACGAGGG 650 
CCGTCCGCAG TTCTACTTCC 700 
TGCCGGAGGG CAC CGAGATG 750 
GTGAAGTTGA TCCAGCCCGT 800 

825 



10 2 ) INFORMATION FOR SEQ ID NO: 123 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 



CGGTGCGATC TTGGTATGTT CCGCAGCTGA CGGTCCTATG CCGCAAACTC 50 

GCGAACACAT CCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAGCTGT TGGAGCTGGT 150 

TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCAGGTGACG 200 

30 ACTGCCCGAT CGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGACGCA 250 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCAACACCT GAGCGTGCAG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGCGGTACAG TAGTAACCGG TCGTGTAGAG 400 

CGCGGTATCA TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 450 

35 AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCTGGTGAC AACGTAGGTG TATTGCTGCG TGGTACTAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACTATCAC 600 

TCCTCACACC AAGT TCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GTGGTCGTCA CACTCCGTTC TTCGCTAACT ACCGTCCACA ATTCTACTTC 700 

40 CGTACTACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

GGTAATGCCG GGTGAGAACG TAACCATTAC TGTAGAACTG ATTGCGCCTA 800 

tcgcta" yo Y~ 



45 



55 



2) INFORMATION FOR SEQ ID NO: 124 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 822 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 

75 
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. CGGCGCAATC TTGGTATGTT CCGCTGCYGA CGGTCCTATG CCGCAAACTC 50 

GCGAACACAT CCTGTTGGCC CGCCAAGTAG GCGTACCTTA CATCATCGTG 100 

TTCATGAATA AATGCGACAT GGTTGAYGAT GCCGAACTGC TGGAACTGGT 150 

5 TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCAGGCGACG 2 00 

ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCCTTGGA AGGCGACGCA 2 50 

GCTTACGAAG AAAAAATCTT CGAACTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCGACACCT GAGCGTGCCG TGGACAAACC GTTCCTGTTG CCTATCGAAG 3 50 

ACGTATTCTC TATCTCCGGC CGTGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

10 CGCGGTATCA TCCACGTCGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 4 50 

AACCCAAAAA AC C ACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCAGGTGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 550 

CGTGAAGAAG TGGAACGCGG TCAAGTATTG GCTAAACCGG GTACCATCAC 6 00 

TCCTCACACC AAATTCAAAG CAGAAGTTTA CGTATTGAGC AAAGAAGAGG 650 

15 GTGGTCGTCA TACTCCGTTC TTCGCTAACT ACCGTCCACA ATTCTACTTC 700 

CGTACTACCG ACGTAACCGG TGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

GGTTATGCCT GGTGAGAACG TGGCCATCAC TGTAGAACTG ATTGCACCTA 8 00 

TCGCTATGGA AGAAGGTCTG CG 822 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 125 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 



CGGCGCGACT TGGTATGTTC CGCAGCTGAC GGTCCTATGC CGCAAACCCG 50 
CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 
4 0 TCATGAACAA ATGCGACATG GTAGACGATG CCGAGCTGTT GGAACTGGTT 150 
GAAATGGAAA TTCGTGACTT GTTGTCAAGC TACGACTTCC CAGGCGACGA 2 00 

ctgcccaatcT gtI^'aa'ggYF^tc "agcu^'tggaa ggi^TcJuuxT ~ A blf ~ 

CTTACGAAGA AAAAATCTTC GAATTGGCTG CTGCCTTGGA CAGCTACATC 3 00 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 350 

45 CGTATTCTCT ATCTCTGGTC GTGGTACAGT AGTAACCGGT CGCGTAGAGC 4 00 

GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACTCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 5 00 

CGAAGGTCAA GCAGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACTAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATCACT 600 

50 CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

TGGTCGTCAC ACTCCATTTT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACCGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 7 50 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCAAT 800 

CGCTATGGAA GAAGTCTGCG 82 0 

55 

2) INFORMATION FOR SEQ ID NO: 126 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA . 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
10 (B) STRAIN: ATCC 49226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

GGTGCAATCC TGGTATGTTC TGCTGCCGAC GGCCCTATGC CGCAAACCCG 50 

15 CGAACACATC CTGCTGGCCC GTCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

TCATGAACAA ATGCGACATG GTCGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGCGACCT GCTGTCCAGC TACGACTTCC CCGGCGACGA 200 

CTGCCCGATC GTACAAGGTT CCGCACTGAA AGCCTTGGAA GGCGATGCCG 250 

CTTACGAAGA AAAAATCTTC GAACTGGCTA CCGCATTGGA CAGCTACATC 3 00 

2 0 CCGACTCCCG AGCGTGCCGT GGACAAACCA TTCCTGCTGC CTATCGAAGA 350 

CGTGTTCTCC ATTTCCGGCC GCGGTACCGT AGTCACCGGC CGTGTAGAGC 4 00 

GAGGTATCAT CCACGTTGGT GACGAGATTG AAATCGTCGG TCTGAAAGAA 450 

ACCCAAAAAA CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAG GCGGGCGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

25 GTGAAGACGT AGAACGCGGT CAGGTATTGG CCAAACCGGG TACTATCACT 600 

CCTCACACCA AGTTCAAAGC AGAAGTGTAC GTATTGAGCA AAGAAGAGGG 650 

CGGCCGCCAT ACCCCGTTTT TCGCCAACTA CCGTCCCCAA TTCTACTTCC 700 

GTACCACTGA CGTAACCGGC GCGGTTACTT TGGAAAAAGG TGTGGAAATG 750 

GTAATGCCGG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 

3 0 CGCTATGGAA GAAGGTCTGC GCTTTGCGAT 83 0 



35 



2 ) INFORMATION FOR SEQ ID NO: 12 7 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

( i i ) MOLECULE "TYPE :~ Genomi c DNA~~ 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 

50 CGGCGCAATC TTGGTATGTT CCGCCGCCGA CGGCCC TATG CCGCAAACCC 50 

GCGAACACAT TCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTA 100 

TTCATGAACA AATGCGATAT GGTCGACGAT GCCGAGCTGT TGGAACTGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCAAG CTACGACTTC CCAGGCGACG 200 

ACTGCCCAAT CGTACAAGGT TCCGCACTGA AAGCTTTGGA AGGCGATGCC 250 

55 GCTTACGAAG AAAAAATCTT CGAACTGGCT GCCGCATTGG ACAGCTACAT 3 00 

CCCGACTCCC GAGCGTGCCG TGGACAAACC GTTCCTGCTG CCTATCGAAG 350 

ACGTATTCTC CATCTCCGGC CGCGGTACGG TAGTAACCGG CCGTGTAGAG 4 00 

CGCGGTGTCA TCCACGTTGG CGACGAGATC GAAATCGTCG GTC TGAAAGA 4 50 

AACCCAAAAA ACCACCTGTA CCGGTGTCGA GATGTTCCGC AAACTGCTGG 500 

60 ACGAAGGTCA GGCAGGCGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 550 

77 
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CGTGAAGAAG TGGAACGCGG TCAGGTATTA GCCAAACCGG GTACCATCAC 600 

TCCGCACACC AAGTTCAAAG CAGAAGTGTA TGTATTGAGC AAAGAAGAGG 650 

GCGGTCGTCA CACTCCGTTC TTCGCCAACT ACCGTCCGCA ATTCTACTTC 700 

CGTACCACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GCGTGGAAAT 750 

GGTAATGCCC GGTGAGAACG TAACCATTAC TGTAGAACTG ATTGCGCCTA 800 

TCGCTATGGA AGAAGG 816 



10 2) INFORMATION FOR SEQ ID NO: 128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
15 (C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: ATCC 13077 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 



CGGTGCAATC CTGGTATGTT CCGCAGCCGA CGGTCCTATG CCGCAAACCC 50 

GCGAACACAT CCTGCTGGCC CGTCAAGTAG GCGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTCGACGAT GCCGAGCTGT TGGAACTGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCCAG CTACGACTTC CCCGGCGACG 200 

30 ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCCTTGGA AGGCGATGCC 250 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCGACTCCC GAGCGTGCCG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGTGGTACAG TAGTAACCGG TCGTGTAGAG 40 0 

CGCGGTATCA TCCACGTCGG TGACGAGATC GAAATCGTCG GTCTGAAAGA 45 0 

35 AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 50 0 

ACGAAGGTCA AGCAGG CGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACAATCAC 60 0 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 65 0 

GCGGCCGCCA TACCCCGTTC TTCGCCAACT ACCGTCCCCA ATTCTACTTC 70 0 

4 0 CGTACCACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTGGAAAT ' 750 

GGTAATGC CG GGCGAGAACG TAACCATCAC CGTAGAACTG ATTGCGCCTA 800 

tcgctatgga" "agaaggtttcT cgcu^gcga" T " a j± " 



45 
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2) INFORMATION FOR SEQ ID NO: 129 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 815 bases 
50 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa, 

(B) STRAIN: ATCC 19696 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

78 
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CGGCGCAATC TTGGTATGTT CTGCTGCYGA CGGTCCTATG CCGCAAACCC 50 

GYGAACACAT CCTGTTGGCC CGTCAAGTAG GYGTAC CTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAAYTGT TGGAACTGGT 150 

5 TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCTGGYGACG 2 00 

ACTGCCCGAT TGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGATGCC 250 

GCTTACGAAG AAAAAATCTT CGAACTGGCT GCCGCATTGG ACAGCTACAT 300 

CCCGACTCCC GAGCGTGCCG TAGACAAACC GTTCCTGTTG CCTATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACAG TAGTAACCGG CCGTGTAGAG 400 

10 CGCGGTGTTA TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 4 50 

AACCCAAAAA ACCACATGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCCGGTGAC AACGTAGGCG TATTGCTGCG CGGTAC CAAA 550 

CGTGAAGAAG TGGAACGCGG TCAAGTATTG GCTAAACCGG GTACCATCAC 600 

TCCGCACACC AAATTCAAAG CAGAAGTGTA CGTATTGAGC AAAGAAGAGG 650 

15 GTGGTCGTCA TACTCCGTTC TTCGCTAACT ACCGTCCTCA ATTCTACTTC 700 

CGTACTACCG ACGTAACCGG TGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

GGTTATGCCT GGTGAGAAYG TAGCCATYAC TGTAGAACTG ATTGCGCCTA 800 

TYGCTATGGA AGAAG 815 

20 



2 ) I NFORMAT I ON FOR SEQ ID NO: 13 0 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



GGCGCAATCT TGGTATGTTC CGCTGCTGAC GGTC CTATGC CGCAAACCCG 50 
CGAACACATC CTGTTGGCCC GCCAAGTAGG CGTACCTTAC ATCATCGTGT 100 
40 TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 
GAAATGGAAA TCCGTGACTT GCTGTCAAGC TACGACTTCC CTGGTGACGA 2 00 

CTGCCCGATC GTACAAGGTT CTGCACTCM AGCCTTGGAA GGCaCGCCG""" 250~" 

CTTACGAAGA AAAAATCTTC GAACTGGCTG CTGCATTGGA CAGCTACATC 3 00 

CCGACTCCTG AGCGTGCCGT GGACAAACCG TTCCTGTTGC CTATTGAAGA 3 50 

45 CGTATTCTCC ATCTCCGGTC GCGGTACCGT AGTAACCGGC CGTGTAGAGC 4 00 

GCGGTGTTAT CCACGTTGGT GACGAGATTG AAATCGTAGG TCTGAAAGAA 4 50 

ACCCAAAAAA CCACTTGTAC CGGTGTTGAA ATGTTCCGCA AACTGCTGGA 5 00 

CGAAGGTCAA GCCGGTGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGAAGT GGAACGCGGT CAAGTATTGG CTAAACCGGG TACCATCACT 600 

50 CCTCACACTA AATTCAAAGC AGAAGTTTAC GTATTGAGTA AAGAAGAGGG 650 

TGGTCGTCAT ACTCCGTTCT TCGCTAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACCGA CGTAACCGGC GCGGTTACTT TGGAAGAAGG TGTAGAAATG 750 

GTTATGCCTG GTGAGAACGT AGCCATCACT GTAGAACTGA TTGCACCGAT 8 00 

CGCTATGGAA GAAGGTCTGC GCTTTGCGA 829 

55 



2 ) INFORMATION FOR SEQ ID NO: 131 
60 (i) SEQUENCE CHARACTERISTICS: 

79 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/0115O 



(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sub f lava 
10 (B) STRAIN: ATCC 14221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

CGGCGCGACT TGGTATGTTC CGCAGCTGAT GGTCCTATGC CTCAAACTCG 50 

15 CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 

TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGTGACCT GTTGTCAAGC TACGACTTCC CAGGCGACGA 200 

CTGCCCAATC GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGTGACGCTG 2 50 

GTTACGAAGA GAAAATCTTC GAATTGGCTG CTGCTCTGGA CAGCTACATC 3 00 

2 0 CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATC GAAGA 350 

CGTATTCTCT ATCTCTGGCC GTGGTACAGT AGTAACTGGT CGTGTAGAGC 4 00 

GCGGTAT CAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACCCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTC CGCA AACTGCTGGA 500 

CGAAGGTCAA GCTGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACCAAAC 550 

25 GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAAC C AGG TACCATTACT 600 

CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

TGGTCGTCAC ACTCCATTCT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACTGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 750 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 8 00 

30 CGCTATGGAA GAAG 814 



2) INFORMATION FOR SEQ ID NO: 132 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii') iyioLECULE TYPeY" Genomic DNA~~ 

( vi ) ORIGINAL SOURCE: 
45 (A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

50 GCCATCTTGG TATGTTCTGC TGCTGACGGT CCTATGCCGC AAACCCGTGA 50 

GCACATCCTG TTGGCTCGTC AAGTAGGTGT ACCCTACATC ATCGTATTCA 100 

TGAACAAATG CGATATGGTT GATGATGCAG AGCTGCTGGA ATTGGTAGAA 150 

ATGGAAATCC GTGATCTGCT GAGCAGCTAC GATTTCCCTG GCGATGATTG 200 

YCCAATCGTG CAAGGTTCTG CTTTGAAAGC TTTGGAAGGT GATGCCGCTT 250 

55 ACGAAGAAAA AATCTTTGAA TTAGCTGCTG CATTGGATAG CTATATTCCA 3 00 

ACWCCTGAGC GYGCTGTTGA TAAACCATTC CTGTTGCCGA TTGAAGATGT 350 

ATTCTCAATT TCAGGTCGTG GTACAGTAGT AACTGGTCGT GTAGAGCGCG 4 00 

GTATTATTCA CGTAGGCGAT GAAATTGAAA TTGTAGGTTT GAAAGARACY 450 

CAAAAAACTA CTTGTACCGG CGTTGAAATG TTCCGTAAAT TGCTGGATSA 5 00 

60 AGGTCAGGCT GGTGATAACG TAGGCGTATT GTTGCGTGGT ACCAAACGTG 550 

80 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



AAGACGTTGA • GCGTGGTCAA GTATTGGCTA 
CAYACCAAAT TCAAAGCAGA RGTKTATGTW 
TCGTCATACT CCGTTCTTCG CTAACTATCG 
CTACAGACGT TACCGGTGCK GTRACTTTAG 
5 ATGC CTGGTG AGAAYGTTGC CATTACTGTW 
KATGGAAGAA GGYTGCGT 



AGCCTGGTWC TATTACTCCG 60 0 

TTGAGYAAGG AAGAAGGCGG 650 
TCCGCAATTC TATTTCCGTA 700 
AAGAAGGTGT GGAAATGGTA 750 
GARYTGATYG CTCCGATTGC 800 

818 



10 2 ) INFORMATION FOR SEQ ID NO: 133 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ochrohac truxn anthropi 

(B) STRAIN: ATCC 49188 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 



CGGCGCAATT CTGGTTGTTT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTCGCT CGTCAGGTTG GCGTTCCGGC AATCGTCGTG 100 

TTCCTGAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTTCGCGAAC TTCTGTCGAA ATACGATTTC CCGGGCGACG 200 

30 AAGTTCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCG 250 

AAGGAACTGG GCGAAGACGC CGTTCGTTCG CTGATGGCCG CTGTTGACGA 3 00 

CTACATTCCG ACCCCGGAAC GTCCGATCGA CCAGCCGTTC C TGATGC CGA 3 50 

TCGAAGACGT TTTCTCGATC TCGGGCCGTG GTACGGTTGT GACGGGT CGC 400 

GTTGAGCGCG GTATCGTCAA GGTTGGTGAA GAAGTTGAAA TCGTCGGCAT 4 50 

35 CAAGGCGACG GCGAAGACGA CGGTAACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGAYCA GGGCCAGGCT GGCGACAACA TCGGCGCTCT GATCCGCGGC 550 

GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TCAAGGCAGA AGCCTACATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTTA CGAACTACCG TCCGCAGTTC 70 0 

40 TACTTCCGCA CGACGGACGT GACCGGTGTT GTCACGCTGC CGGAAGGCAC 750 

GGAAATGGTT ATGCCTGGCG ACAACGTCGC TATGGACGTC ACCCTGATCG 80 0 

TGCCGATCGC "CATGGAAGAG" ^G^CTC^GCt'^C^CtTT" 3 J~b ~ 



45 



55 



2) INFORMATION FOR SEQ ID NO: 134 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 8 05 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN: ATCC 27155 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 

81 



\ 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CCTGGTTGTT GCTGCGACTG ATGGCCCAAT GCCACAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT AGATCATCGT GTTCCTGAAC 100 

AAGTGTGACA TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAGATGGA 150 

5 AGTACGTGAC CTGCTGTCAC AGTACGACTT CCCAGGCGAT GACACCCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGTTCC TGAGTGGGAA 250 

GCAAAAATCG TTGAGCTGGC TGAACACCTG GACAACTACA TCCCGGATCC 30 0 

AGTCCGTGCG ATCGACATGC CGTTCCTGCT GCCAATCGAA GACGTATTCT 350 

CAATCTCTGG CCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGCATC 40 0 

10 GTTAAAGTCG GCGACGAAGT TGAAATCGTG GGTATCAAAG ATACTGCGAA 450 

ATCAACCTGT ACCGGTGTTG AGATGTTCCG TAAGCTGCTG GACCAGGGTC 500 

AGGCAGGCGA AAACTGTGGT GTTCTGCTGC GCGGTATCAA GCGTGAAGAC 55 0 

ATCCAGCGTG GCCAGGTTCT GGCTAAGCCA GGCTCAATCA AGCCGCACAC 600 

CCAGTTCGAG TCAGAAGTTT ACGTTCTGTC TAAAGACGAA GGTGGCCGCC 650 

15 ATACTCCGTT CTTCAAAGGC TATCGTCCAC AGTTCTACTT CCGTACAACT 70 0 

GATGTAACCG GTTCAGTAGA GCTGCCAGAA GGCGTTGAGA TGGTCATGCC 750 

AGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCAATGG 800 

ACGAA 805 

20 



2) INFORMATION FOR SEQ ID NO: 135 

( i ) SEQUENCE CHARACTER I S T I CS : 
25 (A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea. di spersa 

(B) STRAIN: ATCC 145 8 9 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 



CGCGATCCTG GTTGTTGCTG CGACTGATGG CCCAATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGCCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

40 CTGAACAAGT GTGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

GATGGAAGTT CGCGATCTGC TGTCTCAGTA CGACTTCCCA GGCGACGATA 200 

CCCC^TCGT" ACGCGG^^ CGL r i^AAGG~ CciACGC^lX^TJ " Z blf ~ 

TGGGAAGCGA AAGTCGTTGA GCTGGCTGGT CACCTGGATA CTTACATTCC 300 

AGATCCAGTA CGTGCTATCG ATCTGC CGTT CCTGCTGCCA ATCGAAGACG 350 

45 TATTCTCAAT CTCTGGCCGT GGTACCGTTG TTACCGGTCG TGTTGAGCGC 400 

GGCATCGTGA AAGTGGGCGA CGAAGTAGAA ATCGTTGGTA TCAAAGCGAC 450 

TGCCAAGTCT ACCTGTACCG GTGTTGAAAT GTTCCGCAAA CTGCTGGACC 50 0 

AGGGTCAGGC AGGCGAGAAC TGTGGTGTTC TGCTGCGCGG TATCAAGCGT 550 

GAAGAGATCC AGCGTGGTCA GGTTCTGGCT AAGCCAGGCA CCATCAAGCC 600 

50 ACACACCAAG TTCGTATCAG AAGTGTACGT ACTGTCTAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

ACYACTGATG TGACCGGCAM CATMGAACTG CCAGAAGGCG TTGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGRCCGT TGAGCTGATC CACCCAATCG 800 

CGATGGACCA GGGTCTGCGT TTCGC 82 5 

55 



2) INFORMATION FOR SEQ ID NO: 136 
60 (i) SEQUENCE CHARACTERISTICS: 

82 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
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(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 
10 (B) STRAIN: NCTC 10322 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 

CACAAACACG TGAGCACATC CTTTTAGGTC GCCAAGTAGG CGTTCCTTAC 50 

15 ATCATCGTAT TCTTAAACAA ATGCGACATG GTGGATGATG AAGAATTATT 100 

AGAATTAGTT GAAATGGAAG TGCGTGAACT TCTTTCTCAA TATGATTTCC 150 

CAGGTGATGA TACACCAATC GTACGTGGTT CAGCGTTACA AGCGTTAAAC 200 

GGYGTAGCTG AGTGGGAAGA GAAAATTCTT GAGTTAGCCA ACCACTTAGA 250 

TACTTACATT CCAGAGCCAC AACGTGCAAT CGACCAACCG TTCCTTCTTC 3 00 

20 CGATTGAAGA CGTGTTCTCA ATTTCTGGTC GTGGTACAGT AGTAACAGGT 3 50 

CGTGTTGAGC GTGGTATCAT CCGTACAGGT GAAGAGGTTG AAATTGTTGG ■ 400 

TATTAAAGCG ACAACGAAGA CCACAGTAAC AGGTGTTGAG ATGTTCCGTA 4 50 

AATTATTAGA CGAAGGTCGT GCGGGTGAGA ACGTTGGTGC TTTATTACGT 500 

GGTACTAARC GTGAAGAAAT CGAACGTGGT CAAGTGTTAG CGAAACCGGG 550 

25 TTCAATYACG CCACACACTG ATTTTGAATC AGAAGTTTAC GTGTTATCAA 600 

AAGAAGAAGG TGGTCGTCAT ACACCATTCT TCAAAGGTTA CCGTCCACAG 650 

TTCTACTTCC GTACAACGGA CGTAACAGGT ACAATCGAAT TACCGGAAGG 700 

TGTTGAGATG GTGATGCCTG GTGATAACAT CAAGATGACT GTAAGTTTGA 750 

TTCACCCAAT CG 762 

30 



2) INFORMATION FOR SEQ ID NO: 137 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 
45 (B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

TGGAGCTATC TTAGTTGTAT CAGCAGCGGA TGGACCAATG CCACAAACAA 50 

50 GAGAACACAT CTTATTATCA AGACAAGTAG GAGTACCATA TATCGTAGTA 100 

TATTTGAATA AAGCAGATAT GGTAGAAGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTAAGAGAAT TACTATCTGA ATATGGATTC CCAGGAGATG 200 

AAATTCCAAT CATAACAGGA TCATCCTTAG GAGCATTAAA TGGAGAACAA 250 

AAATGGATAG ATCAAATCAT GGCATTGATG AAAGCCGTAG ATGAATATAT 3 00 

55 TCCAACACCG GAAAGAGCAG TAGATCAACC ATTCTTGATG CCAATCGAAG 350 

ACGTATTTAC AATTACAGGA AGAGGAACTG TAGTAACAGG AAGAGTTGAA 4 00 

AGAGGAGTTG TAAAAGTWGG AGAAGAAGTT GAAATCGTAG GAATCAAAGC 450 

GACAACAAAG ACAACTTGTA CYGGAGTAGA AATGTTCCGA AAATTATTGG 500 

ATCAAGGACA AGCAGGAGAT AACATCGGAG CTTTATTRAG AGGAACCAAG 550 

60 AAAGAAGATG TAGAAAGAGG ACAAGTATTG GCAAAACCAG GAACAATTCA 600 

83 
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TCCTCATACA AACTTCAGTG 
GAGGAAGACA TACTCCATTC 
AGAACCACAG ATATTACAGG 
GGTAATGCCR GGAGATAATA 
5 TTGCAATGGA AACAGGATTA 



GAGAAGTATA TGTATTGACA 
TTCTCAGGAT ACAGACCACA 
AGCAGTAACA TTACCAGAAG 
TCACAATGAC AGTAGAATTG 
CGATTTGCAA TT 



AAAGAAGAAG 650 
ATTTTACTTT 700 
GAGTAGAAAT 750 
ATTCACCCAA 800 
832 



10 



15 



2 ) INFORMATION FOR SEQ ID NO: 138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 



25 TAGTATGTTC 
CTACTAGCAA 
AGAAGACCAA 
TCAGAGACCT 
GTAGTAGGAT 

30 AGACAAAATC 
CAGTAAGAGA 
TCAATYACAG 
AGTAAAAGTA 
GACAAGTAGT 

35 GCAGAAGCMG 
AGAAATCCAA 
ACACAAAATT 
AGACACACAC 
AACAGACGTA 

40 TGCCAGGAGA 
ATGGACGAAG 



AGCAGCAGAY 
GACAAGTTGG 
GTAGACGATC 
ACTATCAGAA 
CAGCATTAAA 
GTAAAACTAA 
TACAGAACAC 
GAAGAGGAAC 
GGMGACACAG 
AGTAACAGGT 
GAGACAACAT 
AGAGGACAAG 
TGAAGCAGAA 
CATTCTTTAA 
ACAGGAGACA 
CAACTCAACA 
GACTAAGATT 



GGTCCAATGC 
TGTAC CAAAG 
CAGAACTAAT 
TATGACTTCG 
AGC CCTAGAC 
TGGAAGMAGT 
CCATTCCTAA 
AGTAGCAACA 
TAGAACTAGT 
GTAGAAATGT 
TGGAGCCCTA 
TACTAGCAGC 
GTATACGTAC 
CGGATACAGA 
TCCAACTAGC 
TTTACAGTAA 
CGC 



CACAAACAAG 
ATAGTAGTAT 
TGAATTAGTA 
AYGGAGACAA 
GATCCAGACG 
AGACGAATAC 
TGCCAATCGA 
GGAAGAGTAG 
AGGCTTAACA 
TTAGAAAACA 
CTAAGAGGAG 
ACCAGGAACA 
TAACAAAAGA 
CCACAATTCT ' 
AGACGGAGTA 
CACTAATCAC 



AGAACACATT 50 

TCCTAAACAA 100 

GAGATGGAAA 150 

CACACCAATC 200 

GAGAATGGGG 250 

ATCCCAACAC 300 

AGACRTATTC 350 

AACAAGGTGT 400 

GACGAAAGCA 450 

ACTAGACCTA 500 

TACAAAGAGA 550 

ATCAAACCAC 600 

AGAAGGTGGA 650 

ACTTCAGAAC 700 

GAAATGGTAA 750 

ACCAATCGCA 800 
823 



45 2) INFORMATION FOR SEQ ID NO: 139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

84 
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CTATCATCGT 
CACATCCTAC 
AAACAAAGAA 
TGGAAATCAG 
5 CCAGTAGTAG 
CCCATGGTCA 
TCGACATCCC 
GACGTAATGA 
AAGAGGAACA 

10 AAGATACAAA 
CTAGACCAAG 
AACAAGAGAT 
TAAACCCACA 
GAAGGTGGAC 

15 CTTTAGAACA 
AAATGGTAAT 
CCAATCGCTC 



AGTATCTGCA 
TAGCAAGACA 
GACCAAGTAG 
AGACCTACTT 
TAGGATCTGC 
GACAAAATCC 
AGAAAGAGAC 
CAATCTCAGG 
CTAAAAGTTG 
AGAAACAGTA 
CAGAATCTGG 
CAAATCTCAA 
CACAGAATTC 
GTCACACACC 
ACAGACGTAA 
GCCAGGAGAC 
TAGAAGAAGG 



GCAGACGGTC 
AGTAGGCGTT 
ACGATCCAGA 
TCAGAATACG 
TCTTAAATCA 
TTGACCTAAT 
AACGACCAAC 
ACGTGGAACA 
GTGATACAGT 
GTAACTGGAG 
AGATAACGTA 
GAGGACAAGT 
GAAGGTCAAG 
ATTCTTCAGT 
CAGGAGACAT 
AAC GCAAC AT 
ACTAAGATTC 



CAATGCCCAA 
CCAAAAATCG 
ACTAATCGAA 
ACTTCGATGG 
CTAGAAGAAG 
GGCACAAGTA 
CATTCCTAAT 
GTAGCAACAG 
AGAAATCGTA 
TAGAAATGTT 
GGACTACTAC 
ACTAGCAAAA 
TATACGTACT 
GGATATAGAC 
CGAACTAGAA 
TCAAAATCAC 
GC 



ACAGAGAGAA 50 

CAGTATTCCT 100 

TTAGTAGAAA 150 

AGACAACGCT 200 

GCGGAGAAGG 250 

GACGAATACT 300 

GCCAGTAGAA 350 

GAAGAGTTGA 400 

GGACTAACAG 450 

CCACAAATCM 500 

TAAGAGGAGT 550 

CCAGGWTCAG 600 

AACAAAAGAA 650 

CACAATTCTT 700 

GAAGGCGTAG 750 

ACTCCAAAAA 800 

832 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 140 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphvromonas asaccharalytica 

(B) STRAIN: ATCC 25260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

CGGTGCTATC ATCGTAGTTG CTGCAACTGA TGGTCCTATG CCTCAGACGC 50 

GTGAGCACAT CCTACTAGCA CGTCAGGTCA ACGTACCTCG TCTAGTTGTC 100 

TTTATGAACA AGTGCGACCT TGTTGATGAC GAGGAGATGC TCGAGCTCGT 150 

40 AGAGATGGAT ATGCGTGAGC TACTAAGCTT CTATGACTTT GACGGCGACA 200 

ACACTCCTGT CATCCGTGGT TCTGCTCTTG GTGCTCTCAA TGGTGAGCCT 250 

aagtgggtag" agaag'gYtat ^gag"ctcatg~ ga^ctgtag acac'tt^at 30V ~ 

CCCACTACCT GAGCGCGACA TCGACAAGCC TTTCCTAATG CCTGTAGAGG 350 

ACGTATTCTC TATCACAGGT CGTGGTACTG TCGCTACTGG TCGTATCGAG 40 0 

45 ACTGGTGTCG TTAAGGTCAA CGATGAGGTT CAGATCATCG GTCTAGGTGC 450 

TGAGGGTAAG AAGAGCGTCG TAACTGGCGT GGAAATGTTC CGCAAGATCC 500 

TTGATGAGGG TGAAGCTGGT GATAACGTAG GTCTCCTACT CCGTGGTATC 550 

GACAAGGACG AGATCAAGCG CGGTATGGTC CTAGCACACC CAGGTCAGGT 600 

CAAGCCTCAC GATCACTTCA AGGCTGAGGT CTATATCCTG AAGAAGGAAG 650 

50 AGGGTGGTCG TCACACACCA TTC CACAAC A AGTACCGTCC TCAGTTCTAC 700 

ATCCGTACGC TAGACGTAAC GGGCGAGATC ACACTCCCAG AGGGTGTAGA 750 

GATGGTTATG CCTGGTGATA ACGTCAC CAT CGATGTCAAG CTCATCTCTC 800 

CAGTAGCTTG TAGCGTAGGT CTACGCTTCG C 831 



55 



2) INFORMATION FOR SEQ ID NO: 141 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 818 bases 

85 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingival is 

(B) STRAIN: ATCC 33277 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

CGGTGCTATA ATCGTTGTAG CAGCTACAGA CGGTCCTATG CCTCAGACTC 50 

GCGAGCACAT CCTTTTGGCT CGCCAGGTAA ACGTTCCTCG TCTGGTTGTT 100 

15 TTCATGAACA AATGTGACAT GGTAGACGAT GAAGAGATGC TCGAGCTTGT 150 

TGAAATGGAC ATGCGCGAAC TCCTTTCTTT CTACGATTTC GATGGTGACA 2 00 

ATACCCCTAT CATCCGTGGT TCTGCTCTGG GCGCTTTGAA TGGAGAGCCT 2 50 

CAGTGGGAAG ACAAGGTGAT GGAGCTTATG GAAGCTGTTG ACAACTGGGT 300 

TCCCCTGCCT GAGCGCGATA TCGACAAACC GTTCTTGATG CCGGTTGAAG 350 

2 0 ACGTGTTCTC TATCACGGGT CGTGGTACGG TCGCTACAGG ACGTATCGAA 4 00 

ACCGGTATTG TGAAGACCGG TGACGAAGTT CAAATCATCG GCCTCGGTGC 4 50 

AGAAGGAATG AAGTCGGTTG TTACGGGTGT TGAAATGTTC CGTAAGATTC 500 

TTGACGAAGG TCAGGCTGGT GACAACGTTG GTCTCCTCCT GCGTGGTATC 550 

GATAAGGATC AGAT CAAGCG TGGTATGGTT ATCTCTCACC CGGGTAAGAT 6 00 

25 TACTCCTCAC AAGAGATTTA AGGCCGAGGT TTATATCTTG AAGAAAGAAG 650 

AAGGTGGTCG CCACACTCCT TTC CACAACA AATATCGTCC GCAGTTCTAC 700 

ATCCGTACGC TTGACGTGAC CGGTGAAATC ACTCTTCCCG AAGGAACAGA 750 

AATGGTTATG CCCGGTGACA ACGTAACGAT CACTGTAGAA CTCATCTACC 8 00 

CGGTTGCATG TAATGTAG 818 

30 



2) INFORMATION FOR SEQ ID NO: 142 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

40 

(ii) MOLECULE TYPE; Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 
45 (B) STRAIN: ATCC 49100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

CGGCGCTATT CTGGTTGTTG CTGCAACTGA TGGTCCTATG CCTCAAACTC 50 

50 GTGAGCACAT CCTGTTAGGY CGCCAGGTTG GCGTACCATA CAT CATTGTG 100 

TTCCTGAACA AGTGTGACAT GGTTGAYGAT GAAGAGCTGT TAGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA GTACGATTTC CCAGGTGATG 2 00 

ATACTCCAGT TGTTCGTGGT TCTGCGCTGA AAGCGTTRGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT TGAATTGGCT GACTCCCTGG AYAGCTACAT 3 00 

55 TCCACAGCCA GAGCGTGCAA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTTTTCTC AATCTCTGGC CGTGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

TACTGTGAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AARTTACTGG 500 

ATGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAG 550 

60 CGTGATGAAA TCGAACGTGG TCAAGTATTA GCAAAACCAG GTTCAATCAA 600 

86 
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CCCGCATACT AACTTCGTAT CAGAAGTTTA 
GTGGTCGTCA TACTCCATTC TTCAAAGGCT 
CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGTGATAACA TTCAGATGAC 
5 TCGCGATGGA CGAAGGTTTA CGCTTCGCTA 



TATCCTGAGC AAAGATGAAG 650 
ACCGTCCACA GTTYTACTTC 700 
CTGCCAGAAG GCGTAGAGAT 750 
TGTAACTCTG ATTGCCCCAA 8 00 

830 



2 ) INFORMATION FOR SEQ ID NO: 143 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica. 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

25 TGGTGCTATC TTGGTTGTAG CTGCTACTGA TGGTCCTATG CCTCAGACTC 50 

GTGAGCACGT ATTGCTCGCT CGTCAGGTAA ACGTACCTCG CTTGGTTGTA 100 

TTCTTGAACA AGTGTGATAT GGTTGACGAT GCTGAGATGC TTGACCTCGT 150 

TGAGATGGAG GTTCGTGAGA TCCTCGAGCA GTACGGTTAT GAGGAGGATA 2 00 

CTCCTATTAT TCGTGGTTCT GCACTCGGTG CTTTGAACGG TGTTGAGAAG 250 

30 TGGGTAGACT CTGTAATGGA GCTCATGGAT ACTGTTGACA CTTGGATTGA 3 00 

AGAGCCAGAG CGTGAGATTG ACAAGCCATT CTTGATGCCT GTTGAGGACG 3 50 

TATTCTCTAT CACAGGTCGT GGTACTGTAG CTACTGGTCG TATCGAGACT 400 

GGTATCTGTA AGGTAGGTGA TGAGGTTCAG TTGCTCGGTC TCGGTGAGGA 450 

CAAGAAGTCT GTTATCACTG GTGTTGAGAT GTTCCGTAAG AACCTTCCAA 500 

35 CAGGTCAGGC TGGTGACAAC GTAGGTCTCC TCCTTCGTGG TATCGATAAG 550 

GCTGAGGTTA AGCGTGGTAT GGTTGTTGTG CACCCAGGTG CTATTACTCC 600 

TCACGATCAC TTCAAGGCAT CTATCTATGT ATTGAAGAAG GAAGAGGGTG 650 

GT CGTCAT AC TCCATTCGGT AACAAGTATC GTCCACAGTT CTACCTCCGT 7 00 

ACAATGGACT GTACAGGTGA AATCCACCTC CCAGAGGGCG TTGAGATGGT 750 

40 TATGCCAGGT GACAACGTAG AGATTGAAGT TGTATTGATC TATAAGGTTG 8 00 

CTTTGAACGA GGGTCTTCGT T 821 



45 2) INFORMATION FOR SEQ ID NO: 144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella oralis 

(B) STRAIN : ATCC 332 69 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 
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TGGTGCTATT CTTGTAGTAG CTGCTACTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACGT GCTTCTTGCT CGTCAGGTGA ACGTACCTCG TTTGGTCGTT 100 

TTCTTGAACA AGTGCGATAT GGTTGACGAT GAAGAAATGC TTGAGCTCGT - 150 

AGAAATGGAG CTTCATGAAC TTCTCGAGCA GTATGAATAT GAGGAGGATA 200 

5 CTCCTATTGT TCGTGGTTCG GCACTTGGCG CTCTGAATGG AGTAGAGAAG 250 

TGGGTTGACA GCGTGATGAA GTTGATGGAT AC CGTTGATG AATGGATACA 3 00 

GGAACC AC CG CGTGATCTTG ATAAGCCTTT CTTGATGCCG GTAGAGGATG 3 50 

TATTTTCTAT TACTGGTCGT GGAACGGTTG TTACAGGCCG TATTGAAACT 4 00 

GGTAAGGTTA AGGTGGGCGA TGAAGTTCAA CTTCTTGGTC TCGGTGAAGA 4 50 

10 TAAGAAGTCC GTTGTGACAG GCGTTGAGAT GTTCCGTAAG ATTCTTGACG 500 

AAGGTGAAGC TGGTGATAAT GTAGG CTTG C TGCTTCGTGG TATCGATAAG 550 

ACGGAAGTAA AGCGTGGTAT GGTTGTCGTA CATCCGGGGG CTATTACTCC 600 

TCACGATCAT TTCAAGGCTT CAGTTTACGT ATTGAAGAAA GAAGAAGGCG 650 

GTCGCCATAC TCCGTTTGGT AMCAAGTATC GTCCACAGTT CTATCTTCGT 70 0 

15 ACCATGGACT GTACTGGTGA AATTACTCTT CCGGAAGGAG TTGAGATGGT 750 

AATGCCGGGT GATAACGTCG AAATTGAAGT TAAGTTGATC TATCCGGTAG 8 00 

CTTTGAACGA GGGACTTCGT TTCGCTA 82 7 



20 



2) INFORMATION FOR SEQ ID NO: 14 5 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 3 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Propi an iJbac tedium acnes 

(B) STRAIN: ATCC 6919 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

CGGCGCCATC CTCGTGGTTG CTGCTACCGA CGGCCCGATG CCTCAGACTC 50 

GCGAGCACGT TCTGCTCGCT CGTCAGGTGG GCGTGCCCGC CATCGTCGTC 100 

GCCCTCAACA AGTGCGACAT GGTTGACGAT GAGGAGCTCA TTGAGCTCGT 150 

40 CGAGATGGAG GTCCGCGAGC TGCTGACCTC GCAGGAGT TC GACGGCGACA 2 00 

ACTGCCCTGT CGTTCGCATC TCCGCCTTCC AGGCCCTCCA GGGTGATGAG 2 50 

AAGTC^ACccf agt^atc!:V1:gac"ct^Yg" gacgccgtgg" acgag'i^'at " 3 oV " 

CCCGCAGCCT GAGCGCGATC TCGACAAGCC CTTCCTTATG CCGATCGAGG 350 

ACGTCTTCAC CATCACCGGC CGTGGCACCG TTGTCACCGG TCGTGTCGAG 400 

45 CGCGGCGTCG TCAAGACTGG CGAAGAGGTC GAGATCGTCG GTATCCACGA 450 

GAAGACCCAG AAGACCACCG TTACCGGTGT CGAGATGTTC CGCAAGATCC 5 00 

TCGACGAGGG CCGCGCTGGT GAGAACGTCG GCGTTCTGCT CCGTGGCACC 550 

AAGAAGGAGG ATGTCGTTCG CGGCATGGTC CTCTCCAAGC CTGGTTCCAC 600 

CACCCCCCAC ACCGACTTCG AGGGCCAGGT CTACGTCCTC AAGAAGGATG 650 

50 AGGGTGGCCG CCACAAGCCG TTCTTCTCCC ACTACAGCCC CCAGTTCTAC 7 00 

TTCCGTACCA CGGACGTGAC TGGCACTGTT GAGCTCCCCG AGGGCACCGA 750 

GATGGTCATG C CTGGCGAC A ACACCGACAT GACTGTGCAC CTGATTCACC 800 

CGGTTGCCAT GGAGGATCAG CTCAAGTTCG CTA 833 



55 



2) INFORMATION FOR SEQ ID NO: 14 6 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 74 5 bases 

88 
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10 



15 



20 



25 



30 



35 



40 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirahilis 

(B) STRAIN: ATCC 35659 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 

CACAAACTCG TGAG CACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC 50 

AT CATCGTAT TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGCTGTT 100 

AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAA TACGATTTCC 150 

CAGGTGATGA CACTCCAGTA ATC CGTGGTT CAGCGCTGAA AGCACTGGAA 200 

GGCGAAGCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA 250 

TTCTTATATC CCAGAGCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC 3 00 

CAATCGAAGA TGTATTCTCA ATCTCAGGCC GTGGTACAGT AGTTACTGGT 350 

CGTGTAGAGC GTGGTATCAT CAAAGTAGGT GATGAAGTTG AGATTGTTGG 400 

TATCAAAGAA ACCGCCAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AATTACTTGA CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTGCTGCGT 500 

GGTACAAAAC GTGAAGAAAT CGAACGTGGA CAAGTACTGG CRAAACCAGG 550 

CTCAATCAAC CCACACAACA AATTTGAATC AGAAGTTTAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACTGGT ACTATCGAAT TAC CAGAAGG 700 

CGTAGAAATG GTAATGCCAG GCGACAACGT GAACATGATC GTTGA 745 



2) INFORMATION FOR SEQ ID NO: 14 7 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Proteus pennen 

(B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 



GGAGCTATCC 
TGAGCACATC 
TCCTGAACAA 



TGGTTGTTGC 
CTGTTAGGTC 
ATGTGACATG 
TTCGTGAACT 
ATCCGTGGTT 
AAAAATTGTT 
AGCGTGCAAT 
ATTTCAGGCC 
CAAAGTTGGT 
CAACTTGTAC 
GCAGGTGAGA 
CGAACGTGGA 
AATTTGAATC 
ACTCCATTCT 



TGCGACAGAT 
GTCAGGTTGG 
GTAGATGATG 
TCTGTCTCAG 
CAGCGCTGAA 
GAATTAGCAG 
TGACAAACCA 
GTGGTACAGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAAGTACTGG 
AGAAGTTTAT 
TCAAAGGCTA 



GGCCCAATGC 
TGTTCCTTAC 
AAGAGTTACT 
TACGATTTCC 
AGCACTGGAA 
AAGCACTGGA 
TTCCTGTTAC 
AGTAACAGGT 
AAATCGTTGG 
ATGTTCCGTA 
TCTTCTGCGT 
CGAAACCAGG 
ATTCTGAGCA 
CCGTCCACAG 



CACAAACTCG 
ATCATCGTAT 
GGAATTAGTM 
CAGGTGATGA 
GGCGAAGCAG 
TTCATACATC 
CAATTGAAGA 
CGTGTTGAGC 
TATTAAACCA 
AATTACTTGA 
GGTACTAAAC 
TTCAATCAAC 
AAGATGAAGG 
TTCTACTTCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 



50 GAAATGGAAG 



CACTCCAGTA 
AGTGGGAAGC 
CCAGARCCAG 
CGTATTCTCA 



55 GTGGCGTAAT 



ACAGCGAAAA 
CGAAGGTCGT 
GTGAAGAAAT 
CCACACACTA 



60 TGGTCGTCAT 
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GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG CGTAGAAATG 750 
GTAATGCCAG GTGACAACAT CAACATGATC GTTGAACTGA TTCACCCAAT 800 
CGCGATGGAC GACGGTTTAC GTTTCGCTA 829 



2 ) INFORMATION FOR SEQ ID NO: 14 8 



10 



15 



20 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 82 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 



CGGAGCTATT 
GTGAGCACAT 

25 TTCCTGAACA 
AGAAATGGAA 
ACACTCCAGT 
GAGTGGGAAG 
CCCAGAACCA 

3 0 ACGTATTCTC 
CGTGGTGTTG 
CACAGTTAAA 
ACGAAGGTCG 
CGTGAAGAAA 

35 GCCACACACT 
GTGGTCGTCA 
CGTACAACTG 
GGTAATGCCA 
TCGCGTAGGA 

40 



CTGGTTGTTG 
CCTGTTAGGT 
AATGTGACAT 
GTTCGTGAAC 
AATCCGTGGT 
CAAAAATTGT 
GAGCGTGCAA 
AATCTCTGGT 
TTAAAGTTGG 
ACAACTTGTA 
TGCAGGTGAG 
TCGAACGTGG 
AAATTCGAAT 
CACTCCATTC 
ACGTAACTGG 
GGTGACAACA 
CGACGGTTTA 



CTGCGACTGA 
CGCCAGGTTG 
GGTTGATGAT 
TTC TGTCTCA 
TCAGCGCTGA 
TGAATTAGCA 
TTGACAAACC 
CGTGGTACAG 
TGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
ACAAGTACTG 
CAGAAGTTTA 
TTCAAAGGTT 
TACTATCGAA 
TCAACATGAT 
CGTT 



TGGCCCAATG 
GTGTACCTTA 
GAAGAACTGC 
GTACGATTTC 
AAGCACTGGA 
GAAGCACTGG 
ATTCCTGCTG 
TAGTAACAGG 
GAGATTGTTG 
AATGTT CCGT 
TTCTTCTGCG 
GCTAAACCAG 
TATCCTGAGC 
ACCGTCCACA 
TTACCAGAAG 
CGTTGAACTG 



CCACAAACTC 50 

CATCATCGTA 100 

TGGAATTAGT 150 

CCAGGTGATG 200 

AGGCGAAGCT 250" 

ATTCTTACAT 3 00 

CCTATCGAAG 350 

CCGTGTAGAG 40 0 

GTATTAAAGA 450 

AAATTACTTG 500 

TGGTACTAAA 550 

GTTCAATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 70 0 

GCGTAGAAAT 750 

ATTCACCCTA 800 
824 



2) INFORMATION FOR SEQ ID NO: 149 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia alcalifaciens 
55 (B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 

CACAAACTCG TGAGCACATC CTGTTAGGTC GCCAAGTAGG TGTTCCTTAC 50 
60 ATCATCGTTT TCCTGAACAA ATGTGACATG GTAGACGACG AAGAACTGTT 100 

90 
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AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC 150 

CAGGCGATGA CACTCCAGTT GTTCGCGGTT CAGCACTGAA AGCGCTGGAA 2 00 

GGCAACCCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG GTTACCTGGA 250 

TTCTTACATC CCAGAACCAG AGCGTGCAAT TGACAAGCCA TTCCTGCTGC 3 00 

5 CAATCGAAGA CGTATTCTCA ATCTCTGGTC GTGGTACAGT AGTAACAGGC 350 

CGTGTTGAGC GTGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG 4 00 

TATTCAAGCG ACTGCGAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AACTGCTGGA TGAAGGTCGT GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT 500 

GGTACTAAAC GTGAAGAAAT TCAACGTGGT CAAGTACTGG CTAAACCAGG 550 

10 TTCAATCAAG CCACACACTC AATTCGAATC AGAAGTATAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACCGGT ACTATCGAAC TGCCAGAAGG 700 

CGTAGAGATG GTAATGCCAG GCGACAACAT CAACATGATC GTGAC 745 



15 



2) INFORMATION FOR SEQ ID NO: 150 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia. rettgeri 

(B) STRAIN: ATCC 9250 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAAGTAG GTGTWCCTTA CATCATCGTT 100 

35 TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGG CGACG 200 

ACACTCCAGT TGTCCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCAACCCA 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCA GGTCACTTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCAATCGAAG 350 

40 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

cacggYtaaa "acaacttgta - ct^cgttga" aa t c^tcc g t~ ^^ct gc tgg" ~ 5 00 ** 

ACGAAGGTCG TGC<3GGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAA 550 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCAAAACCAG GTTCAATCAA 600 

45 GCCACACACT AAATTCGAAT CAGAAGTCTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGATAACA TCAACATGAT CGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTTTA CGTTTCGCAA 830 



2) INFORMATION FOR SEQ ID NO: 151 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

91 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 
5 (B) STRAIN : ATCC 33673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

10 GTGAGCACAT CCTGTTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTT 100 

TTCCTGAACA AATGTGACAT GGTTGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCAGT TGTTCGYGGT TCAGCACTGA AAGCGCTGGA AGGTATCCCT 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCC GGTTACCTGG ACAGCTACAT 3 00 

15 CCCAGAACCA GAGCGCGCAA TTGACCGTCC ATTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC AAT CTCTGGT CGTGGTACAG TAGTAACAGG SCGTGTTGAG 4 00 

CGTGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACRGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTTG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTG TTTTACTGCG TGGTACTAAG 550 

2 0 CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACT ACTTTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 70 0 

CGTACAACTG ACGTAACCGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TCAACATGAT CGTGACACTG ATTCACCCAA 8 00 

25 TCGCGATGGA TGATGGTTTA CGTTTC 82 6 



2) INFORMATION FOR SEQ ID NO: 152 



30 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 

45 CGGTGCAATC CTAGTTGTTG CGGCAACAGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGTGATG 200 

ACACTCCAGT TATCCGTGGT TCAGCGCTGA AAGCGTTGGA AGGCAACCCA 250 

50 GAGTGGGAAG CGAAAATCGT TGAACTAGCA GAAGCACTGG ACAGCTACAT 3 00 

CCCAGAGCCA GAGCGTGCAA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC AATCTCAGGT CGTGGTACAG TAGTCACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTAG GTATCAAAGA 4 50 

GACTGCGAAA ACCACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTGG 500 

55 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG TTCTGCTGCG TGGTACTAAG 550 

CGTGAAGAAA TCGAACGTGG TCAAGTTCTG GCGAAACCAG GTTCAATCAA 600 

GCCACACACA ACTTTCGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACGCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

60 GGTAATGCCA GGCGACAACG TGAACATGAA AGTAACTCTG ATTCACCCAA 8 00 
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TCGCGATGGA CGATGGTTTG CGTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 153 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa. 

(B) STRAIN: ATCC 3 5554 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 



CCTGGTTTGC TCGGCTGCCG ACGGCCCCAT GCCGCAGACC CGCGAGCACA 50 

TCCTGCTGTC CCGCCAGGTA GGCGTTCCCT ACATCGTCGT GTTCCTGAAC 100 

AAAGCCGACA TGGTCGACGA CGCCGAGCTG CTGGAACTGG TCGAGATGGA 150 

AGTTCGCGAT CTGCTGAACA CCTACGACTT CCCGGGCGAC GACACTCCGA 200 

25 TCATCATCGG TTCCGCGCTG ATGGCGCTGG AAGGCAAGGA TGACAACGGC 25 0 

ATCGGCGTAA GCGC CGTGCA GAAGCTGGTA GAGACC CTGG ACTCCTACAT 300 

TCCGGAGCCG GTTCGTGCCA TCGACCAGCC GTTCCTCATG CCGATCGAAG . 350 

ACGTGTTCTC GATCTCCGGT CGCGGTACCG TGGTAACCGG TCGTGTAGAG 400 

CGCGGCATCA TCAAGGTCCA GGAAGAAGTG GAAATCGTCG GCATCAAGGC 450 

30 GACC AC CAAG ACCACCTGCA CCGGCGTTGA AATGTTCCGC AAGCTGCTCG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTA TCCTGCTGCG TGGCACCAAG 550 

CGTGAAGACG TAGAGCGTGG CCAGGTACTG GCCAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAGT GCGAAGTGTA CGTGCTGTCC AAGGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTCAAGGGCT ACCGTCCGCA GTTCTACTTC 70 0 

35 CGTACCACKG ACGTGACCGG TAMCTGCGAG CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTCACCCTG ATCGCTCCGA 800 

TCGCCATGGA AGATGGCTGC GTTCGCG 827 



40 



50 



2) INFORMATION FOR SEQ ID NO: 154 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 841 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas fluoresceins 

(B) STRAIN: ATCC 13525 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

CGGCGCAATC CTGGTTTGCT CGGCCGCTGA TGGTCCGATG CCACAAACCC 50 

GTGAACACAT CCTGCTGTCC CGTCAGGTTG GCGTTCCGTA CATCGTGGTT 100 

TACCTGAACA AGGCTGACCT GGTAGACGAC GCTGAGCTGC TGGAACTGGT 150 

60 TGAGATGGAA GTGCGCGATC TGCTGAGCAC TTACGACTTC CCAGGCGACG 200 

93 
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ACACTCCGAT CATCATCGGT TCTGCTCGTA TGGCTCTGGA AGGCAAAGAC 250 

GACAACGAAA TGGGCACCAC GTCCGTTCGT AAACTGGTTG AAACTCTGGA 3 00 

CAGCTACATC CCAGATCCAG TTCGTGTTAT CGACAAGCCG TTCCTGATGC 3 50 

CAATCGAAGA CGTGTTCTCG ATCTCCGGTC GCGGTACTGT TGTGACTGGT 4 00 

5 CGTATCGAGC GCGGTATCGT TAAGGTTCAA GATCCACTGG AAATCGTTGG 4 50 

TCTGCGTGAC ACTACCGTCA CCACCTGCAC CGGTGTTGAA ATGTTCCGTA 5 00 

AGCTGCTCGA CGAAGGTCGT GCTGGCGAGA ACTGCGGCGT TCTGCTGCGT 550 

GGTACCAAGC GTGACGACGT TGAGCGTGGC CAGGTTCTGG TTAAGCCAGG 600 

TTCGGTTAAG CCGCACACCA AGTTCGAAGC TGAAGTCTAC GTACTGAGCA 650 

10 AAGAAGAAGG CGGTCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCACAG 700 

TTCTACTTCC GTACTACTGA CGTGACTGGT AACTGCGAGC TGCCGGAAGG 750 

CGTTGAAATG GTTATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA 8 00 

TCAAAACCAT CGCAATGGAA GACGGTCTGC GTTTCGCTAT T 841 



15 



2 ) INFORMATION FOR SEQ ID NO: 155 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 841 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas stutzeri 

(B) STRAIN: ATCC 17588 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

CGGCGCGATC CTGGTCTGCT CGGCTGCTGA CGGCCCCATG CCGCAGACTC 50 
GCGAGCACAT CCTGCTGTCC CGTCAGGTTG GTGTTCCGTA CATCGTCGTG 100 
TTCCTGAACA AGGC CGACAT GGTTGATGAC GCCGAGCTGC TCGAGCTGGT 150 
35 CGAGATGGAA GTTCGYGACC TGCTGTCGAC CTACGACTTC CCGGGTGAYG 2 00 

ACACTCCGAT CATCATCGGC TCCGCGCTGA TGGCGCTGAA CGGCGAAGAC 2 50 

GACAACGAGC TCGGCACCAC TGCGGTGAAG AAGCTGGTCG AGACCCTGGA 3 00 

CAGCTACATT CCCGAGCCGG TTCGTGCCAT CGACAAGCCG TTCCTGATGC 350 
CGATCGAAGA CGTGTTCTCG ATCTCCGGTC GCGGCACSGT GGTAACCGGT 400 
40 CGCGTAGAGC GCGGCATCGT CAAGGTTCAG GAAGAGATCG AGATCGTCGG 4 50 

TCTGCGTCCG ACCACCAAGA CTACCTGCAC CGGCGTTGAG ATGTTCCGCA 500 

AGCTGCTCG/T Y(j]&(3&riJ&l r l3(Jl\^ bbu" 

GGCACCAAGC GTGACGAAGT GGAGCGTGGT CAGGTTCTGG CCAAGCCGGG 600 

CACCATCAAG CCGCACACCA AGTTCGAAGC CGAAGTGTAC GTGCTGTCCA 650 

45 AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TYAAGGGCTA CCGTCCKCAG 700 

TTCTACTTCC GTACCACTGA YGTGACYGGW TCGTGCGARC TGCCGGAAGG 750 

CGTCGAGATG GTAATGCCGG GCGACAACGT GAAGATGGTT GTCACCCTGA 800 

TCAAGC CG AT CGCCATGGAA GACGGCCTGC GCTTCGCGAT T 841 

50 

2) INFORMATION FOR SEQ ID NO: 156 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

94 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

GCTATTCTAG TAGTATCAGC AACTGACGGT CCAATGCCAC AAACACGTGA 50 

GCACATTCTA TTATCACGTC AGGTTGGTGT ACCATACATC ATCGTATTCA 100 

10 TGAACAAATG TGACATGGTA GATGACGAAG AGTTACTAGA GCTAGTAGAA 150 

ATGGAAGTGC GTGAATTACT TTCAGACTAC GACTTCCCAG GTGATGACAC 200 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAATGGT AACGACGGTA 250 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT AGACACTTAC 300 

ATTCCAGAGC CAGAGCGTGA CATCGATAAG CCATTCCTAA TGCCAATCGA 350 

15 AGACGTATTC TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGCCGTGTTG 400 

AATCTGGTAT CATCAAAGTT GGTGACGAAA TCGAAATCGT TGGTATCAAA 450 

GACACAGTTA AAACAACTTG TACTGGTATC GAGATGTTCC GTAAGTTACT 500 

AGACGAAGGT CGTGCTGGTG AGAACTGTGG TGTACTATTA CGTGGTACTA 550 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGCTAAGCC AGGTTCAATC 600 

20 ACTCCACACA CCAACTTCGA CGCAGAAGTA TACGTACTAT CAAAAGAAGA 650 

AGGTGGTCGT CACACTCCAT TCTTAAATGG TTACCGTCCA CAGTTCTACT 700 

TCCGTACTAC TGACGTAACA GGTGCAATCA CGTTACAAGA AGGTACTGAA 750 

ATGGTAATGC CAGGCGATAA CGTTGAGATG AGCGTAGAGC TAATCCACCC 800 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA ATC 833 

25 



2) INFORMATION FOR SEQ ID NO: 157 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Rahnella aquatilis 
40. (B) STRAIN: ATCC 33071 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACTCG 50 

45 TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATAC ATGAT CGTGT 100 

TCATGAACAA ATGCGACATG GTAGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT TCTGTCTGCT TACGAATTCC CAGGCGACGA 2 00 

CATCCCGGTC ATCAAAGGTT CAGCGCTGAA AGCACTGGAA GGCGATGCTA 250 

CTTGGGAAGC GAAAATCATC GAACTGGCAG AAGCACTGGA CAGCTACATT 3 00 

50 CCATTGCCAG AGCGTGCTAT CGATAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACAGT GGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCGT TAAAGTGGGC GAAGAAGTTG AAATCGTCGG TATCAAGGAC 450 

ACTGTTAAGT CTACTTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCA AGTTTGATTC CGAAGTGTAC ATCCTGAGCA AAGATGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTTATGCCTG GTGACAACGT GAACATGGTT GTTACCCTGA TCCACCCAAT 800 

60 CGCGATGGAT GACGGTCTGC GTTTC 825 

95 
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2 ) INFORMATION FOR SEQ ID NO: 158 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

20 CGGCGCGA.TC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 5 0 

GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ATACGC CGAT CGTT CGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCC TGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 50 0 

30 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

35 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



40 2) INFORMATION FOR SEQ ID NO : 159 



{ X ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 (vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

60 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

96 
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ACACGCCGAT 
GAGTGGGAAG 
TCCGGAACCA 
ACGTATTCTC 
CGCGGTATCA 
GACTCAGAAG 
ACGAAGGCCG 
CGTGAAGAAA 
GCCGCACACC 
GCGGCCGTCA 
CGTACTACTG 
GGTAATGCCG 
TCGCAATGGA 



CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 
TCAAAGTGGG 
TCTACCTGTA 
TGC CGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
TACT C CGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



TCTGCTCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
YGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGTTTCGCAA 



AAGCGCTGGA 
GGCTTCCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATTCTGTCC 
ACCGTCCGCA 
CTGCCGGAAG 
TGTTACCCTG 
TC 



AGGTGACGCA 
ATTCTTACAT 
CCGATCGAAG 
TCGTGTAGAG 
GTATCAAAGA 
AAACTGTTGG 
TGGTATCAAA 
GCACCATCAA 
AAAGATGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCGA 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
832 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 160 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 807 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTC CTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTAC CTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 500 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 8 07 



2) INFORMATION FOR SEQ ID NO: 161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
5 serotype Heidelberg 

(B) STRAIN: ATCC 8326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

10 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

15 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

20 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 6 00 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA TTGCCGGAAG GCGTAGAGAT 750 

25 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



30 2) INFORMATION FOR SEQ ID NO: 162 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 
35 (C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43 974 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACGCCGA 2 00 

50 TCGTGCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAATGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTACCTG GATTCTTACA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAAC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CTATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

55 GTCGACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGCGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AATTCTACTT CCGTACGACT 700 

60 GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

98 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 
ACGACGG 8 07 



2) INFORMATION FOR SEQ ID NO: 163 

( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 82 7 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuls subsp . indica 

(B) STRAIN: ATCC 43976 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

25 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 2 00 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 3 00 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 4 00 

3 0 GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 4 50 

TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTC CGCAAA CTGCTGGACG 5 00 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

35 GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 7 50 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 800 

CRATGGACGA CGGTCTGCGT TTCGCAA 827 



40 



55 



2) INFORMATION FOR SEQ ID NO: 164 



( i ) SEQUENCE CHARACTERI STI CS : 
45 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 
60 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

99 
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AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCGA 2 00 

TCGTGCGTGG TTCCGCTCTG AAAGCGCTGG AAGGCGAMGC TGAGTGGGAA 2 50 

GMGAAAATCA TCGAACTGGC TGGCTWCCTG GATTCTTACA TTCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

5 CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCCGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

10 CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 807 

15 



2 ) INFORMATION FOR SEQ ID NO: 165 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
30 serotype Typhi 

(B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

35 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

40 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTC CGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 6 00 

GCCGCACACY AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 70 0 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

50 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



55 2 ) INFORMATION FOR SEQ ID NO: 166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

100 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(D) 



TOPOLOGY : Linear 



10 



15 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 



CGGCGCTATC 
GTGAGCACAT 
TTCATGAACA 
AGAAATGGAA 
ACCTGCCGGT 
GAGTGGGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGCGGTATCG 
CACCGTTAAG 
ACGAAGGCCG 
CGTGAAGACA 
GCCGCACACT 
GTGGTCGTCA 
CGTACAACTG 
GGTAATGCCA 
TCGCTATGGA 



CTGGTTGTAG 
CCTGCTGGGT 
AATGCGACAT 
GTTCGTGAAC 
TGTTCGTGGT 
CTAAAATCAT 
GAGCGTGCTA 
CATCTCCGGT 
TTAAAGTTGG 
TCTACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
CAGTTCGATT 
TACTCCATTC 
ACGTGACCGG 
GGCGATAACG 
CCAAGGC 



CTGCGACTGA 
CGTCAGGTTG 
GGTTGATGAT 
TTCTGTCTGC 
TCAGCGCTGA 
CGAGCTGGCC 
TCGATCAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CTGGCGTTGA 
AACGTTGGTG 
TCAGGTACTG 
CAGAAGTGTA 
TTCAAAGGCT 
TACCATCGAA 
TGAACATGGT 



CGGCCCTATG 
GCGTTCCTTA 
GAAGAGCTGC 
TTATGACTTC 
AAGCACTGGA 
GGTCACCTGG 
GTTCCTGCTG 
TAGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAACCAG 
TATCCTGAGC 
ACCGTCCACA 
CTGCCAGAAG 
TGTTACCCTG 



CCTCAGACTC 50 

CATCATCGTA 100 

TGGAACTGGT 150 

CCTGGTGATG 200 

AGGCGAAGCT 250 

ATTCCTACAT 300 

CCAATCGAAG 350 

TCGTGTTGAG 400 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTATCAAG 550 

GTTCCATCAA 600 

AAAGAAGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATCCACCCAA 8 00 

817 



30 



40 



2) INFORMATION FOR SEQ ID NO: 167 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 787 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 

GCTGCGACTG ACGGCCCAAT GCCTCAGACC CGTGAGCACA TCCTGCTGGG 50 

TCGTCAGGTT GGCGTTCCTT TCATCATCGT ATTCATGAAC AAATGCGACA 100 

TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAAATGGA AGTTCGTGAA 150 

50 CTTCTGTCTG CTTACGACTT CCCTGGTGAT GACCTGCCGG TTGTTCGTGG 2 00 

TTCAGCGCTG AAAGCACTGG AAGGCGAAGC TGAGTGGGAA GCTAAAATCA 250 

TCGAGCTGGC CGGTTACCTG GATTCTTACA TCCCAGAACC AGAGCGTGCT 3 00 

ATCGACAAGC CGTTCCTGCT GCCAATCGAA GACGTCTTCT CCATCTCCGG 3 50 

TCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGTATC GTTAAAGTTG 400 

55 GCGAAGAAGT TGAAATCGTT GGTATCAAAG ACACCGTTAA GTCTACCTGT 450 

ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC GTGCTGGTGA 500 

GAACGTTGGT GTTCTGCTGC GTGGTATCAA GCGTGAAGAC ATCGAACGTG 550 

GTCAGGTACT GGCTAAACCA GGTTCAATCA AGCCACACAC CAAGTTCGAC 600 

TCAGAAGTGT ACATCCTGAG CAAAGAAGAA GGTGGTCGTC ATACTCCATT 650 

60 CTTCAAAGGC TACCGTCCAC AGTTCTACTT CCGTACAACT GACGTGACCG 700 
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GTACCATCGA ACTGCCAGAA GGCGTTGAAA TGGTAATGCC AGGTGACAAC 750 
GTGAACATGG TTGTTACCCT GATCCACCCA ATCGCGA 787 



2) INFORMATION FOR SEQ ID NO: 168 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LiENGTH: 745 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE; Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13 880 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 2 00 

AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 250 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 300 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTAC CG 350 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 4 00 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 4 50 

CAAACTGCTG GACGAAGGCC GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 500 

GTGGTATCAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTT CTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 700 

GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 745 



2) INFORMATION FOR SEQ ID NO : 169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorifera 

(B) STRAIN: ATCC 33077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 200 

CCTGCCAGTA ATC CGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 
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AGTGGGAAGC TAAGATTGTA GAACTGGCTG 
CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGTC GTGGTACCGT 
GCGGTATCAT CAAAGTTGGC GAAGAAGTTG 
5 ACCGTTAAGT CTACCTGTAC CGGTGTAGAA 
CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT 
GTGAAGACAT CGAACGTGGT CAGGTTCTGG 
CCGCACACCA AATTCGACTC AGAAGTGTAC 
TGGTCGTCAC ACGCCATTCT TCAAAGGCTA 
10 GTACTACTGA CGTGACCGGT ACCATCGAAC 
GTAATGCCAG GCGATAACGT GAACATGGTT 
CGCAATGGAC GACGGTCTGC GTTTCGCAA 



AAGCGCTGGA TTCTTACATC 3 00 

TTCCTGCTGC CAATCGAAGA 350 

TGTTACCGGT CGTGTTGAGC 400 

AAATCGTTGG TATCAAAGAC 450 

ATGTTCCGCA AACTGCTGGA 500 

TCTGCTGCGT GGTAT CAAGC 550 

CTAAACCAGG TTCTATCAAG 600 

ATCCTGAGCA AAGAAGAAGG 650 

CCGTCCACAG TTCTACTTCC 7 00 

TGCCAGAAGG CGTAGAGATG 750 

GTTACCCTGA TTCACCCAAT 8 00 

829 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 17 0 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 0 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTT CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

35 AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTACGACTTC CCTGGTGATG 200 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAACCA 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCT GGTTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGACAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

£0 CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTGGGTG TTCTGCTGCG TGGTATCAAG 550 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACC AAGTTTGACT CAGAAGTGTA CATCCTGAGC AAAGAAGAAG 650 

45 GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGAC CGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGACAACG TGAACATGGT TGTAACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA 830 



2 ) INFORMATION FOR SEQ ID NO: 171 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

SO (ii) MOLECULE TYPE: Genomic DNA 

103 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATGC 27593 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTACCTTAC ATCATCGTAT 100 

10 TCATGAACAA ATGCGACATG GTAGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGCGAACT GCTGTCTGCT TACGACTTCC CAGGCGACGA 200 

CCTGCCGGTA ATCCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAAATCGTT GAGCTGGCAG AAGCGCTGGA CAGCTACATC 300 

CCAGAGCCAG AGCGTGCTGT AGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCC" ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTAGG TATCAAAGAC 450 

ACCGTTAAGT CTACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CGAAGCCAGG TTCAATCAAG 600 

2 0 CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACGT GAACATGAAA GTTACTCTGA TTCACCCAAT 8 00 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 82 9 

25 



2) INFORMATION FOR SEQ ID NO: 172 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
40 (B) STRAIN: ATCC 92 07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGT AC CG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGC CGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

60 TCGCGATGGA CGACGGTCTG CGTTTC 826 
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2) INFORMATION FOR SEQ ID NO: 173 

5 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

2 0 TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 
CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA , 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

25 GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCYTAYATT CCGGAACCAG 3 00 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 3 50 

ATCTCCGGTC GTGGTACCGT TGTTAC CGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACYCAGAAGT 450 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

30 GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CGAAGCCRGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

35 GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC 800 

GACGGTCTGC GTTTCGCA 818 



40 2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella, flexneri 

(B) STRAIN: ATCC 12022 



55 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 174 



CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 10 0 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 200 

60 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 2 50 
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GCGAAAATCC TGGAACTGGC TGGCTTCCTG 
AGAGCGTGCG ATTGACAAGC CGTTCCTGCT 
CCATCTCCGG TCGTGGTACC GTTGTTACCG 
ATCAAAGTTG GTGAAGAAGT TGAAATCGTT 
5 GTCTACCTGT ACTGGCGTTG AAATGTTCCG 
GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 
ATCGAACGTG GTCAGGTACT GGCTAAGCCG 
CAAGTTCGAA TCTGAAGTGT ACATTCTGTC 
ATACTCCGTT CTTCAAAGGC TACCGTCCGC 
10 GACGTGACTG GTACCATCGA ACTGCCGGAA 
GGGCGACAAC ATCAAAATGG TTGTTACCCT 
ACGACG 



GATTCTTACA TTCCGGAACC 3 00 

GCCGATCGAA GACGTATTCT 350 

GTCGTGTAGA ACGCGGTATC 4 00 

GGTATCAAAG AGACTCAGAA 450 

CAAACTGCTG GACGAAGGCC 500 

GTGGTATCAA ACGTGAAGAA 550 

GGCACCATCA AGCCGCACAC 600 

CAAAGATGAA GGCGGMCGTC 650 

AGTTCTACTT CCGTACTACT 700 

GGCGTAGAGA TGGTAATGCC 750 

GATCCACCCG ATCGCGATGG 800 

806 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO : 175 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella soxmei 

(B) STRAIN: ATCC 2 993 0 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

35 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

40 CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTT CGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

45 GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 17 6 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

TCTGCTGCTG ACGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC 50 

ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC AAAGTTGACA 100 

10 TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC 150 

TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG TAATCGCTGG 200 

TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA GAAAAAATCT 250 

TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC AGAACGTGAT 300 

TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG 350 

15 TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG 400 

GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA AACAACTGTT 450 

ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG AAGCTGGTGA 500 

CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG 550 

GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC TGAATTCAAA 600 

20 GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC ACACTCCATT 650 

CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT GACGTAACTG 700 

GTGTTGTTCA CTTACC 716 



25 



35 



2) INFORMATION FOR SEQ ID NO: 177 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 719 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 50 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

45 TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 200 

GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 250 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 400 

50 AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 4 50 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC 550 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 600 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

55 GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMCTGAAA TGGTAATGCC 700 

TGGTGATAAC GTTGAAATG 719 



60 2) INFORMATION FOR SEQ ID NO: 178 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 33591 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 



GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

2 0 CAATACGAAG AAAAAATCTT AGAATTAATG GAAG CTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAA.CC ATTCATGATG CCAGTTGAGG 3 00 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

2 5 ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATT CAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 62 5 



2) INFORMATION FOR SEQ ID NO: 179 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

50 GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 150 

TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 2 00 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 250 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 30 0 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 350 

55 AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 4 00 

TTGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG TGACAACATT 450 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGT CAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA TACTGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

60 AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 700 
TGAC 704 



2) INFORMATION FOR SEQ ID NO: 18 0 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 73 0 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 653 8 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

25 ATGTAC CTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 2 00 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 00 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

30 CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTT CCGT AAATTATTAG 4 50 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAAC GTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

35 CGTACTACTG ACGTAACTGG TGTTGTTCAC TTACCAGAAG GTACTGAAAT 700 

GGTAATGCCT GGTGATAACG TTGAAATGAC 73 0 



40 2) INFORMATION FOR SEQ ID NO : 181 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 4 bases 

(B) TYPE: Nucleic acid 
45' (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 



CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGAC C A AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATACGACTAC CCAGGTGACG 200 

60 ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGCGACAAA 250 
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GAATACGAAC AAAAAATCTT 
TCCAACTCCA GAACGTGACT 
ACGTATTCTC AATCACTGGT 
CGTGGTCAAA TCAAAGTCGG 
5 CGGTTCACAA AAAACAACAG 
TAGACTACGC TGAAGCTGGT 
TCACGTGAAG AAGTACAACG 
TACACCACAC ACTAAATTCA 
AAGGTGGACG TCACACTCCA 
10 TTCCGTACTA CTGACGTAAC 
AATGGTAATG CCTGGCGATA 
CAAT CGCTAT CGAAGACGGT 



AGACTTAATG CAACAAGTTG 
CTGATAAACC ATTCATGATG 
CGTGGTACTG TTGCAACAGG 
TGAAGAAGTT GAAATCATCG 
TTACTGGTGT AGAAATGTTC 
GACAACATCG GTGCTTTATT 
TGGTCAAGTT TTAGCTGCTC 
CTGCAGAAGT TTACGTATTA 
TTCTTCTCTA ACTACCGTCC 
AGGTGTTGTT ACTTTACCAG 
ACGTTAAAAT GGAAGTTGAA 
ACTCGTTTCT CAAT 



ACGATTACAT 


300 


CCAGTTGAAG 


350 


CCGTGTTGAA 


400 


GTATGAAAGA 


450 


CGTAAATTAT 


500 


ACGTGGTATT 


550 


CTGGTTCAAT 


600 


TCTAAAGATG 


650 


ACAATTCTAT 


700 


AAGGTACAGA 


750 


TTAATTTCTC 


800 




834 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 182 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 835 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CGGCGGTATC TTAGTAGTAT CTGCTG CTGA CGGTCCAATG CCACAAACTC 5 0 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 10 0 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

35 TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 20 0 

ATGT AC CTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 30 0 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

40 CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 50 0 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 55 0 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

45 GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 83 5 



2 ) INFORMATION FOR . SEQ ID NO: 183 



(i) SEQUENCE CHARACTER I STI CS : 
55 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

SO (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Macrococcus caseolyticus 

(B) STRAIN: ATCC 13548 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 

GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATGCCACA AACTCGTGAA 50 

CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG TAGTATTCTT 100 

10 GAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGACTTATTA TCTGAATATG ACTTCCCTGG TGACGATGTA 2 00 

CCTGTAATCG CTGGATCTGC TTTAAAAGCA TTAGAAGGCG TTGAAGAATA 250 

CGAAGACAAA ATCATGGAAT TAATGGACGC AGTTGATGAG TACATCCCAA 3 00 

CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 3 50 

15 TTCTCAATCA CTGGTCGTGG TACAGTTGCA ACTGGACGTG TTGAGCGTGG 4 00 

ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGAAGAAC 450 

CAGCAAAAAC TACAGTTACA GGTGTAGAAA TGTTCCGTAA ATTATTAGAT 500 

TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTACGTG GTGTTTCTCG 550 

TGAAGACGTA CAACGTGGAC AAGTATTAGC TAAACCAGGT TCAATTACTC 60 0 

20 CACATACTAA ATTCAAAGCT GAAGTTTACG TATTATCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CACTAACTAC CGCCCTCAGT TCTACTTCCG 7 00 

TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TAATGCCTGG AGATAACATC GAAATGAACG TTGAATTAAT TTCTCCAATC 80 0 

GCGA 804 

25 



2) INFORMATION FOR SEQ ID NO: 184 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
£0 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

£5 GTGAACATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTTCAT 300 

50 TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 450 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 7 00 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT ACTTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

50 CAATCGCTAT CGAAGACGGT ACACGTTTCT CT 832 
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2) INFORMATION FOR SEQ ID NO: 185 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Staphylococcus Gpidermidis 

(B) STRAIN: CSG 269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

20 ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC ATCTTATTAT 50 

CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA CAAAGTTGAC 100 

ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG AAGTTCGTGA 150 

CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT GTAATCGCTG 2 00 

GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA ACAAAAAATC 250 

25 TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC CAGAACGTGA 300 

TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC TCAATCACTG 350 

GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA AATCAAAGTT 400 

GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA AAACAACTGT 4 50 

TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCTGGTG 500 

30 ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA CGTACAACGT 550 

GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA CAAAATTCAA 600 

AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT CACACTCCAT 650 

TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC TGACGTAAC 699 



35 



50 



2 ) INFORMATION FOR SEQ ID NO: 186 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

55 TTCTTAAATA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

60 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 
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CGTGGGCAAA TCAAAGTTGG 
CACTTCTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
5 ACCTCACACA AAATTTAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 
TCGCGATTGA AGACGGTACT 



TGAAGAAGTT GAAATCATTG 
CTGGTGTAGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCTGCTCCAG 
CAGACGTATA CGTTTTATCT 
TTCACAAACT ATCGTCCACA 
TGTTGTTAAC TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCA 



AAATTATTAG 500 

TGGTGTTGCT 550 

GTT CAATCAC 600 

AAAGACGAAG 650 

ATTCTATTTC 7 00 

GTACTGAAAT 750 

ATCGCTCCTA 800 
829 



2) INFORMATION FOR SEQ ID NO: 187 

15 { i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus warneri 
25 (B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 50 

30 TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 200 

GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC AAGCAGTTGA 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 30 0 

35 CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 40 0 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT ACAACGTGGT CAAGTATTAG CTGCTCCTGG 550 

40 TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTACTACTGA CGTAACTGGC GTTGTT CAAT TACCAGAAGG 700 

TACTG 705 



2) INFORMATION FOR SEQ ID NO: 188 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus ha.emolytlcus 

(B) STRAIN: CSG 23 

60 
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(xi) SEQUENCE DESCRI1 

TTTTATCACG TAACGTTGGT 
GTTGACATGG TTGACGATGA 
5 ACGTGACTTA TTATCTGAAT 
TCGCTGGTTC AGCTTTAAAA 
AAAATCTTAG AATTAATGCA 
ACGTGACTCT GATAAACCAT 
TCACTGGTCG TGGTACTGTT 

10 AAAGTTGGTG AAGAAGTTGA 
AACTGTTACT GGTGTAGAAA 
CTGGTGACAA CATCGGTGCT 
CAACGTGGTC AAGTATTAGC 
ATTCAAAGCA GACGTATACG 

15 CTCCATTCTT CACTAACTAT 
GTAACTGGTG TTGTTAACTT 



TION: SEQ ID NO: 188; 

GTACCAGCAT TAGTAGTATT 
AGAATTATTA GAATTAGTTG 
ACGACTTCCC AGGTGACGAC 
GCTTTAGAAG GCGATGCTCA 
AGCAGTTGAT GATTACATTC 
TCATGATGCC AGTTGAGGAC 
GCTACAGGTC GTGTTGAACG 
AATTATTGGT ATCAAAGAAA 
TGTTCCGTAA ATTATTAGAC 
TTATTACGTG GTGTTGCTCG 
TGCTCCAGGT TCAATTACAC 
TTTTATCAAA AGATGAAGGT 
CGTCCACAAT TCTATTTCCG 
ACCAGAAG 



CTTAAACAAA 50 

AAATGGAAGT 100 

GTACCTGTAA 150 

ATACGAAGAA 200 

CAACTCCAGA 250 

GTATTCTCAA 300 

TGGTCAAATC 350 

CTTCTAAAAC 400 

TACGCTGAAG 450 

TGAAGATGTA 500 

CTCACACAAA 550 

GGACGTCATA 60 0 

TACTACTGAC 65 0 
678 



20 2) INFORMATION FOR SEQ ID NO: 189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 33 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

10 CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 4 00 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 4 50 

45 ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 

50 



2) INFORMATION FOR SEQ ID NO: 190 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic. acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
5 (B) STRAIN: CSG 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TTGAAATGGA 50 

10 AGTACGTGAC TTATTATCTG AATACGACTT CCCAGGTGAC GATGTACCTG 100 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 150 

GAAAAAATCT TAGAATTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 2 00 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 250 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 3 00 

15 ATCAAAGTTG GTGAAGAAGT TGAAATCATT GGTATCCATG ACACTTCTAA 350 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 4 00 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 450 

GTACAACGTG GTCAAGTATT AGCTGCTCCA GGTTCAATCA CACCTCACAC 500 

AAAATTTAAA GCAGACGTAT ACGTTTTATC TAAAGACGAA GGTGGACGTC 550 

2 0 ACACTCCATT CTTCACAAAC TATCGTCCAC AATTCTATTT CCG 593 



2) INFORMATION FOR SEQ ID NO: 191 



25 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

40 CGGCGCTATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCTTTAA AAGCTTTAGA AGGCGATGCT 2 50 

45 CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTATAT 3 00 

TCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATTATTG GTATCAAAGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

50 ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGATG TACAACGTGG TCAAGTATTA GCTGCTCCAG GTTCAATTAC 600 

ACCTCACACA AAATTCAAAG CAGACGTATA CGTTTTATCA AAAGATGAAG 650 

GTGGACGTCA TACTCCATTC TTCTCTAACT ATCGTCCACA ATTCTATTTC 7 00 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

55 GGTAATGCCT GGTGACAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTC 828 



60 2) INFORMATION FOR SEQ ID NO: 192 
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SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 620 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGTACCAG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 150 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 2 00 

20 AAAGC TTTAG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 25 0 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 3 00 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 400 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 450 

2 5 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 62 0 



2) INFORMATION FOR SEQ ID NO: 193 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 150 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 200 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAAC CATTC A 250 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 350 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
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AGAAGGTACT GAAATGGTAA TGCCTGGTGA CAACGTTGAA AT : 



5 2 ) INFORMATION FOR SEQ ID NO: 194 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 



CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 50 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG ACTTATTATC TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT GCTCAATACG 200 

25 AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 250 

CCAGAACGTG ACTCTGATAA ACCATTCATG ATGCCAGTTG AGGACGTATT 300 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 350 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 400 

AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 450 

30 TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC 550 

ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 650 

CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 684 

35 



2) INFORMATION FOR SEQ ID NO: 195 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 

ACCAGCATTA GTAGTATTCT 
55 AATTATTAGA ATTAGTTGAA 

GACTTCCCAG GTGACGACGT 

TTTAGAAGGC GATGCTCAAT 

CAGTTGATGA TTACATTCCA 

ATGATGCCAG TTGAGGACGT 
SO TACAGGCCGT GTTGAACGTG 



TAAACAAAGT TGACATGGTT 
ATGGAAGTAC GTGACTTATT 
ACCTGTAATC GCTGGTTCAG 
ACGAAGAAAA AATCTTAGAA 
ACTCCAGAAC GTGACTCTGA 
ATTCTCAATC ACTGGTCGTG 
GTCAAATCAA AGTTGGTGAA 
117 



GACGATGAAG 50 

ATCTGAATAC 100 

CTTTAAAAGC 150 

TTAATGCAAG 200 

TAAACCATTC 250 

GTACTGTTGC 300 

GAAGTTGAAA 350 
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TTATTGGTAT CAAAGAAACT TCTAAAACAA 
TTCCGTAAAT TATTAGACTA CGCTGAAGCT 
ATTACGTGGT GTTGCTCGTG AAGATGTACA 
CTCCAGGTTC AATTACACCT CACACAAAAT 
5 TTATCAAAAG ATGAAGGTGG ACGTCATACT 
TCCACAATTC TATTTCCGTA CTACTGACGT 
CAGAAGGTAC TGAAATGGTA ATGCCTGGCG 



CTGTTACTGG TGTAGAAATG 4 00 

GGTGACAACA TCGGTGCTTT 450 

ACGTGGTCAA GTATTAGCTG 500 

TCAAAGCAGA CGTATACGTT 550 

CCATTCTTCA CTAACTATCG 600 

AACTGGTGTT GTTAACTTAC 650 

ACAAC 685 



LO 

2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 611 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

20 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 62 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT GACGACGTAC CTGTAATCGC 50 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

30 GACTCTGATA AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC 200 

TGGTCGTGGT ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 400 

35 GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGCAGACG TATATGTTTT AT CAAAAG AT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA 550 

CTGGTGTTGT TAACTTACCA GAAGGT AC TG AAATGGTAAT GCCTGGCGAC 600 

AACGTTGAAA T 611 

10 



2) INFORMATION FOR SEQ ID NO: 197 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
55 (B) STRAIN: ATCC 43809' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
50 GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 

118 
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TTCTTAAACA AAGTTGACAT 
AGAAATGGAA GTTCGTGATT 
ATGTGCCTGT AATCGCTGGT 
AAATACGAAG CTAAAATCTT 
5 TCCAACTCCA GAACGTGACT 
ACGTATTCTC AATCACTGGT 
CGTGGTCAAA TCAAAGTCGG 
TACTACTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 

10 CGTGAAGATG TACAACGTGG 
ACCTCACACT AAATTTAAAG 
GTGGACGTCA TACACCATTC 
CGTACTACAG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 

15 TCGCTATCGA AGACGGAACT 



GGTTGACGAT GAAGAATTAT 
TATTAACTGA ATATGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG GATGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
TGAAGAAGTT GAAATTATTG 
CTGGTGTAGA AATGTTCCGT 
AACATCGGTG CGTTATTACG 
ACAAGTATTA GCTGCTCCAG 
CTGACGTATA TGTTTTATCT 
TTCTCAAACT ACCGCCCACA 
TGTTGTTAAC TTACCAGAAG 
TTGAAATGAC AGTTGAATTA 
CGTTTCTC 



TAGAATTAGT 150 

CCAGGTGACG 2 00 

AGGCGACGAA 250 

ATAACTACAT 3 00 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 

GTATCCACGA 450 

AAATTATTAG 50 0 

TGGTGTTGCT 550 

GTTGAATTAC 600 

AAAGATGAAG 650 

ATTCTATTTC 700 

GTACAGAAAT 750 

ATCGCTCCAA 800 

828 



2) INFORMATION FOR SEQ ID NO : 198 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Staphylococcus saprophytdcus 

(B) STRAIN : ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

35 AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACACATTC 50 

TTTTAT CACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT CTTAAACAAA 100 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT GTACC TGTAA 2 00 

TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA CTATGAGCAA 250 

40 AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC CAACACCAGA 3 00 

ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTCAAATC 4 00 

AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG AATCAAGCAA 450 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

45 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC ACGTGATGAC 550 

GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA CACCACATAC 600 

AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA GGTGGTCGTC 650 

ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 690 



50 



2 ) INFORMATION FOR SEQ ID NO: 199 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 
(C> STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

119 
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(vi) ORIGINAL SOURCE : 

{A) ORGANISM: Staphylococcus saprophyticus 
(B) STRAIN: CSG 83 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

GCATTAGTTG TATTCTTAAA CAAAGTTGAC ATGGTTGACG ATGAAGAATT 50 

ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTTATTAAGC GAATATGACT 100 

10 TCCCAGGTGA CGATGTACCT GTAATCTCTG GTTCTGCATT AAAAGCTTTA 150 

GAAGGCGACG CTGACTATGA GCAAAAAATC TTAGACTTAA TGCAAGCTGT 200 

TGATGACTTC ATTCCAACAC CAGAACGTGA TTCTGACAAA CCATTCATGA 250 

TGCCAGTTGA GGACGTATTC TCAATCACTG GTCGTGGTAC TGTTGCTACA 300 

GGCCGTGTTG AACGTGGTCA AATCAAAGTC GGTGAAGAAA TCGAAATCAT 350 

15 CGGTATGCAA GAAGAATCAA GCAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT TGGTGCATTA 450 

TTACGTGGTG TTTCACGTGA TGACGTACAA CGTGGTCAAG TTTTAGCTGC 500 

TCCTGGTACT ATTACACCAC ATACAAAATT CAAAGCGGAT GTTTACGTTT 550 

TATCTAAAGA TGAAGGTGGT CGTCATACAC CATTCTTCAC TAACTACCGC 600 

20 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

AGAAGGTACT GAAATGGTTA TGCCTGGCGA TAACGTTGAA ATGGATGTTG 7 00 

AATTAATTTC TCCAATCGCT ATT 723 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 200 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 697 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) ' STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT GACATGGTTG 50 

ACGATGAAGA ATTATTAGAA TTAGTAGAAA TGGAAGTTCG TGACTTATTA 100 

AGCGAATATG ACTTCCCAGG TGACGATGTA CCTGTAATCT CTGGTTCTGC 150 

4 5 ATTAAAAGCT TTAGAAGGCG ACGCTGACTA TGAGCAAAAA ATCTTAGACT 2 00 

TAATGCAAGC TGTTGATGAC TTCATTCCAA CACCAGAACG TGATTCTGAC 250 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG 300 

TACTGTTGCT ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTCGGTGAAG 350 

AAATCGAAAT CATCGGTATG CAAGAAGAAT CAAGCAAAAC AACTGTTACT 400 

50 GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG CTGGTGACAA 450 

CATTGGTGCA TTATTACGTG GTGTTTCACG TGATGACGTA CAACGTGGTC 500 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CACATACAAA ATTCAAAGCG 550 

GATGTTTACG TTTTATCTAA AGATGAAGGT GGTCGTCATA CACCATTCTT 600 

CACTAACTAC CGCCCACAAT TCTATTTCCG TACTACTGAC GTAACTGGTG 650 

55 TTGTTAACTT ACCAGAAGGT ACTGAAATGG TTATGCCTGG CGATAAC 697 



2 ) INFORMATION FOR SEQ ID NO : 201 

60 

120 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 201 

15 CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 200 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 250 

20 GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 300 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAACTGA 450 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

25 TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 600 

CACACCTCAC ACTAAATTCA AAGCTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AACTTACCAG AAGGTACTGA 750 

30 AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 800 

CAATCGCTAT TGAAGACGGT ATCGTTTCTC AATCA 835 



35 2 ) INFORMATION FOR SEQ ID NO: 202 

( i ) SEQUENCE CHARACTERI ST I CS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

55 ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

60 CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
5 CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AGACGGTACT 



AACATCGGTG CTTTATTACG 
TCAAGTATTA GCTGCTCCTG 
CGGAAGTTTA CGTTTTATCT 
TTCAGTAACT ACCGCCCACA 
CGTTGTTCAA TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCAA C 



PCT/CA00/01150 

TGGTGTTGCT' 5% 6 

GTTCAATTAC 600 

AAAGACGAAG 650 

ATTCTATTTC 700 

GTACTGAAAT 750 

ATCGCTCCTA 8 00 

831 



10 



20 



45 



2 ) INFORMATION FOR SEQ ID NO: 203 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 50 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

30 AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

35 CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 600 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

40 GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 829 



2) INFORMATION FOR SEQ ID NO: 204 



( i ) SEQUENCE CHARACTERI STI CS : 
50 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 157 07 

60 

122 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04 

TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCCCAGACTC 50 

GCGAGCACGT GCTGCTCGCC CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 100 

5 GCC CTGAACA AGTGCGACAT GGTCGACGAT GAAGAGCTCA TCGAGCTCGT 150 

CGAAGAAGAG GTCCGCGACC TCCTCGACGA GAACGGCTTC GACCGTGACT 200 

GCCCGGTCAT CCACACCTCC GCTTACGGTG CTCTGCACGA CGACGCTCCG 250 

GACCACGAGA AGTGGGTCCA GTCCGTTAAG GACCTCATGG ACGCTGTCGA 300 

CGACTACATC CCGACCCCGG TTCACGACCT GGACAAGCCG TTCCTGATGC 350 

10 CGATCGAGGA CGTCTTCACC ATCTCCGGCC GTGGTACCGT TGTCACCGGT 400 

CGTGTCGAGC GTGGCCAGCT GGCCGTCAAC ACCCCGGTCG AGATCGTTGG 45 0 

TATCCGTCCG ACCCAGCAGA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGACCATGGA CGCCTGCGAG GCTGGCGACA ACACCGGTCT GCTTCTGCGT 550 

GGTCTCGGCC GTGACGATGT CGAGCGTGGC CAGGTTGTGG CCAAGCCGGG 600 

15 CTCCGTCACC CCGCACACCA AGTT CGAGGG CGAAGTCTAC GTGCTGACCA 65 0 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 70 0 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCATCGAGC TGCCGGAAGG 75 0 

CGTCGAGATG GTTCAGCCGG GCGACCACGC TACCTTCACC GTTGAGCTGA 800 

TTCAGCCCAT CGCTATGGAG GAAGGCCTGA CCTTCGCTG 83 9 

20 

2) INFORMATION FOR SEQ ID NO: 2 05 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 54 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
35 (B) STRAIN: CDC F333 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

£0 GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCG AC CAG 2 50 

AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGTCGACG CGCTGGACAG 300 

i5 CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 350 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 4 00 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

50 ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 750 

CGAA 754 

55 

2) INFORMATION FOR SEQ ID NO: 206 
50 (i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ATCTTCCAGT TGTTCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 250 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 300 

TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 350 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTAT CGAC 400 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 70 0 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 83 5 



2) INFORMATION FOR SEQ ID NO : 2 07 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactia** 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC 
CACACACTAA ATTTAAAGGT GAAGTTTACA 
GGACGTCATA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTAACAGGTT CAATCGAACT 
TTATGCCTGG TGATAACGTT ACTATCGAAG 
GCCGTAGAAC AAGGTACTA 



TAAACCAGGT ^Z^Zl&ZX* 



too 

650 
700 
750 
800 
819 



TCCTTTCTAA AGAAGAAGGT 
CGTCCACAAT TCTACTTCCG 
TCCAGCAGGA ACAGAAATGG 
TTGAATTGAT TCACCCAATC 



LO 



L5 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 2 08 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 

CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AAC CAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT CGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG • 40 0 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 750 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



2) INFORMATION FOR SEQ ID NO : 209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
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AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA ,250 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 3 00 

AGAACCAGAA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 4 00 

LO GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 450 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAC AACGTTGGTG TTCTTCTTCG TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 650 

L5 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA ATTCTACTTC 700 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 750 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT AC 822 



20 



35 



2) INFORMATION FOR SEQ ID NO: 210 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agala.ctia.e 

(B) STRAIN: CDCss-1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA 100 

40 TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATACGACTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA AGGCGACGAA 250 

AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATGAGTACAT 3 00 

TCCAGAACCA GAACGTGATA CTGACAAACC TTTACTTCTT CCAGTTGAAG 350 

45 ATGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTCGTGTCAA CGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

AGATATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTT 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

50 CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTTCAATC GAACTTCCAG CAGGAACAGA 750 

AATGGTTATG CCTGGTGATA ACGTTACTAT CGAAGTTGAA TTGATTCACC 800 

CAATCGCCGT AGAACAAGGT ACTAC 825 

55 

2) INFORMATION FOR SEQ ID NO: 211 
60 ( i ) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 
10 (B) STRAIN: ATCC 33397 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 50 

15 TGAACACATC CTTCTTT C AC GCCAAGTAGG TGTTAAATAC CTTAT CGTCT 100 

TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 150 

GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 200 

AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 250 

AATATGAAGA CATCATCATG GAATTGATGG ATACTGTTGA TGAATACATT 3 00 

20 CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA 350 

TGTATTCTCA ATTACTGGAC GTGGTACTGT TGCTTCAGGA CGTATCGACC 4 00 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 4 50 

GAAATC CAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT 500 

GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 550 

25 AACGTGACGA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 600 

CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 650 

AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 7 00 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 8 00 

3 0 AATTGCCGTA GAACAAGGAA CTACAT 826 



2) INFORMATION FOR SEQ ID NO: 212 



35 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

50 TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC 50 

GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 2 00 

AAATCCCTGT AATCCAAGGT TCAGCTCTTA AAGCCCTTGA AGGTGACACT 250 

55 CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 3 00 

TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

CGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

50 TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGTATC 550 
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CAACGTGATG AAATCGAACG TGGTCAAGTT 
CCACCCACAC ACTAAATTCA AAGGTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCAACA 
TTCCGTACAA CTGACGTTAC AGGTTCAATC 
5 AATGGTAATG CCTGGTGATA ACGTTACTAT 
CAATCGCCGT TGAACAAGGT ACTACAT 



CTTGCTAAAC CAGGTTCAAT 600 
TTACATCCTT ACTAAAGAAG 650 
ACTACCGTCC TCAATTCTAC 700 
GAACTTCCAG CAGGTACTGA 750 
CGACGTTGAA TTGATTCACC 800 

827 



10 2) INFORMATION FOR SEQ ID NO: 213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus euiginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 27823 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 50 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

30 ATGGAAATCC GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT 2 00 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 2 50 

ATGAAGACAT CATCATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 3 00 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 350 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGAC CGTG 4 00 

35 GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 4 50 

ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 550 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAAC CAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCCTTACTA AAGAAGAAGG 650 

4 0 TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 750 

GTAATGCCTG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 8 00 

TGCCGTAGAA CAAGGAACTA C 821 



45 



2) INFORMATION FOR SEQ ID NO: 214 



{ i ) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTGA 50 

ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCGTCTTCA 100 

5 TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 2 00 

CCCTGTTGTT CAAGGTTCAG CTCTTAAAGC CCTTGAAGGT GATACAGCTG 250 

CCGAAGACAA GATCATGGAA TTGATGGACA TCGTTGATGA CTACATTCCA 3 00 

GAACCAAAAC GTGATACTGA TAAGCCATTG CTTCTTCCAG TCGAAGACGT 350 

10 ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 400 

GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 450 

ATCCAAAAAG CGGTTGTTAC CGGAGTTGAA ATGTTCCGTA AACAATTGGA 500 

TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GCTTCTTCGT GGTATCCAAC 550 

GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCAT 600 

15 CCACACACTA AATTCAAGGG TGAAGTTTAC ATCCTTTCTA AAGATGAAGG 650 

TGGACGTCAC ACTCCATTCT TCAACAACTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT TCAATCGAAT TGCCAGCAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TCCACCCAAT 800 

CGCTGTTGAA AAAGGTACTA C 821 

20 

2) INFORMATION FOR SEQ ID NO: 215 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cri status 
35 (B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACT CGTGAGC 50 

40 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATC CGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 250 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 300 

45 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 3 50 

TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 4 50 

CCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 7 00 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT ' 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 800 

55 CCGTTGAACA AGGTACTCCT T 821 



60 



2 ) INFORMATION FOR SEQ ID NO: 216 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 



15 AGTAGCTTCT 
TTTCACGTCA 
GACTTGGTTG 
TGACCTGCTT 
AAGGTTCAGC 

2 0 ATCATGGAAT 
TGATACTGAT 
CTGGACGTGG 
GTCAACGACG 
AGTTGTTACC 

2 5 CAGGGGATAA 

GAACGTGGTC 
GTTTAAAGGT 
CTCCATTCTT 
GTAACTGGTT 

3 0 TGATAACGTT 



ACTGATGGAC 
GGTTGGTGTT 
ACGATGAAGA 
TCAGAATACG 
TCTTAAGGCT 
TGATGGACAT 
AAGCCTTTGC 
TACTGTAGCT 
AAGTTGAAAT 
GGAGTTGAAA 
CGTTGGTGTG 
AAGTGTTGGC 
GAAGTTTACA 
TAACAACTAC 
CAATCGAATT 
ACTATCGACG 



CAATGCCACA 
AAGAACCTTA 
ATTGCTTGAA 
ATTTCCCAGG 
CTTGAAGGTG 
CGTTGATGAC 
TTCTTCCAGT 
TCAGGACGTA 
CGTTGGTATC 
TGTTCCGTAA 
CTTCTTCGTG 
TGCGCCTGGT 
TCCTTTCTAA 
CGTCCACAGT 
GCCAGCGGGT 
TTGAATTGAT 



AACTCGTGAA 
TCGTCTTCAT 
TTGGTTGAAA 
TGATGATATC 
ATACAGCTGC 
TACATTCCAG 
CGAAGATGTA 
TCGACCGTGG 
AAGGACGAAA 
ACAATTGGAC 
GTATCCAACG 
TCGATTCACC 
AGAAGAAGGT 
TCTACTTCCG 
ACTGAAATGG 
CCACCCAATT 



CACATCTTGC 50 

GAACAAGGTT 100 

TGGAAATCCG 150 

CCTGTTGTTC 2 00 

CGAAGACAAG 250 

AACCAAAACG 300 

TTCTCAATCA 350 

TACTGTTAAG 4 00 

TCCAAAAAGC 450 

GAAGGTCTTG 500 

TGATGAAATC 550 

CACACACTAA 600 

GGACGTCATA 650 

TACAACTGAC 70 0 

TTATGCCTGG 750 

GC 792 



2) INFORMATION FOR SEQ ID NO: 217 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43 078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG ACCAATGCCA CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 100 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 150 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 2 00 

CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 250 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 300 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 400 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 450 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 500 

50 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 550 
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AAATTCAAAG GTGAAGTATA TATCCTTTCT 
CACTCCATTC TTCAACAACT ATCGTCCACA 
ACGTAACAGG TTCAATCGAA CTTCCAGCTG 
5 GGTGATAACG TGACAATCAA CGTTGAGTTG 



PCT/CA00/01150 

GTTCAATCAA CCCACACACT 600 

AAAGACGAAG GTGGACGTCA 650 

ATTCTACTTC CGTACAACTG 700 

GTACAGAAAT GGTTATGCCT 750 

ATCCACCCAA TCGCC 795 



2 ) INFORMATION FOR SEQ ID NO: 218 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Streptococcus egui subsp . egui 

(B) STRAIN: ATCC 9528 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

25 CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 100 

TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 250 

30 AAATACGAAG ATATCATCAT GGAATTGATG GATACTGTTG ATTCATACAT 300 

TC CAGAACCA GAACGTGACA CAGACAAGCC ATTGCTTCTT CCAGTCGAGG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGCGGTACTG TTCGTGTTAA CGACGAAATC GAAATCGTTG GTATCAGAGA 450 

CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 500 

35 TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGCTAAGC CAGGTTCTAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT ATATATCCTT ACTAAAGAAG 650 

AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 750 

40 AATGGTTATG CCTGGTGATA ACGTGACTAT TGACGTTGAG TTGATCCACC 800 

CAATCGCCGT AGAACAAGGT ACTACATT 828 



45 2 ) INFORMATION FOR SEQ ID NO: 219 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA 
GTGAGCACAT CCTTCTTTCA CGTCAGGTAG 
TTCATGAACA AAGTTGACTT GGTTGACGAT 
TGAAATGGAA ATCCGTGACC TGCTTTCAGA 
5 ACCTTCCAGT TATCCAAGGT TCAGCTCTTA 
GCTCAAGAAG ATGTTATCAT GGAATTGATG 
CCCAGAACCA GAACGTGATA CTGACAAACC 
ATGTATTCTC AATCACAGGT CGTGGTACTG 
CGTGGTACTG TAAGAGTCAA CGATGAAGTT 

10 CGAAATCACT AAAGCAGTTG TTACCGGTGT 
TGGACGAAGG TCTTGCTGGT GATAACGTTG 
CAACGTGATG AAATCGAACG TGGTCAAGTA 
CAACCCACAC ACTAAATTTA AAGGTGAAGT 
AAGGTGGACG TCATACACCA TTC TTCAACA 

15 TTCCGTACAA CTGACGTAAC TGGTTCAATC 
AATGGTTATG CCTGGTGATA ACGTG AC TAT 
CAATCGCCGT TGAACAAGGT ACTAC 



TGGACCAATG CCACAAACTC 50 

GTGTTAAACA CCTTATCGTC 100 

GAAGAATTGC TTGAATTGGT 150 

ATATGATTTC CCAGGTGATG 2 00 

AAGCGCTTGA AGGTGATACT 250 

AAAACCGTTG ATGAGTACAT 3 00 

ATTGCTTCTT CCAGTCGAAG 350 

TAGCTTCAGG ACGTATCGAT 400 

GAAATCGTTG GTATCAAAGA 450 

TGAAATGTTC CGTAAACAAT 500 

GTGTGCTTCT CCGTGGTGTG 550 

TTGGCTAAAC CAGGTTCAAT 600 

TTACATCCTT ACTAAAGAAG 650 

ACTACCGTCC ACAGTTCTAC 700 

GAATTGCCAG CAGGTACTGA 750 

CGACGTTGAA TTGATCCACC 800 

825 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 22 0 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 6 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10558 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 

CGGAGCTATC CTTGTAGTAG CTTCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CCTTCTCTCA CGCCAAGTTG GTGTTAAACA CTTGATCGTG 100 

TTCATGAACA AAGTTGACTT GGTTGACGAT GAAGAATTGC TTGAGTTGGT 150 

4 0 TGAAATGGAA ATCCGTGACC TCTTGTCAGA ATACGACTTC CCAGGTGACG 2 00 

ATCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCTCTTGA AGGTGACTCT 2 50 

AAATATGAAG ATATCATCAT GGAATTGATG AACACTGTTG ATGAGTACAT 3 00 

CCCAGAACCA GAACGCGACA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 4 00 

45 CGTGGTATCG TTAAAGTCAA TGACGAAATC GAAATCGTTG GTATCAAAGA 450 

AGAAATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTGCTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG TGGACAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTATATCCTT ACTAAAGAAG 650 

50 AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTTAC AGGTTCAATC GAACTTCCAG CAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGACGTTGAG TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTACT 826 



2) INFORMATION FOR SEQ ID NO: 221 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 799 bases 
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WO 01/23604 PCT/CA00/01 150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 50 

TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 100 

15 GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT " CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 2 00 

TCCAAGGTTC AGCACTTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 250 

ATCATCATGG AATTGATGAA TACTGTTGAT GAATATATTC CAGAACCAGA 3 00 

ACGTGATACT GACAAACCAT TGCTTCTTCC AGTCGAAGAT GTATT CTCAA 3 50 

20 TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 4 00 

AAAGTCAACG ATGAAGTTGA AATCGTTGGT ATCCGCGAGG AAATCCAAAA 45 0 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 50 0 

TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATCCA ACGTGACGAA 550 

ATTGAACGTG GACAAGTTCT TGCTAAACCA GGTTCAATTC ATCCACACAC 600 

25 TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 65 0 

ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT C CGTACTACA 70 0 

GACGTTACAG GTTCAATCGA ACTTCCTGCA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 7 99 



2) INFORMATION FOR SEQ ID NO: 222 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macaoae 

(B) STRAIN: ATCC 35911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 



TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG CCTCAAACGC 50 

GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA CCTTATTGTT 100 

50 TTCATGAATA AAGTTGACTT AGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC CCAGGCGATG 200 

AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA AGGTGATACT 250 

AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG ATGATTACAT 3 00 

CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT CCAGTCGAAG 350 

55 ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG ACGTATTGAC 400 

CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG GTATTCGTGA 450 

CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT TCGTGGTATC 550 

CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC CAGGATCAAT 600 

50 TCATCCACAT ACTAAATTCA AAGGTGAAGT TTATATTCTT ACTAAAGAAG 650 
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PCT/CA00/01150 

ACTACCGTCC . ACAGTTCTAC 70 0 

GATTTGCCAG CAGGTACTGA 750 
TGATGTTGAA CTGATCCACC 800 

825 



2) INFORMATION FOR SEQ ID NO: 223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
Streptococcus mitis) 

(B) STRAIN: ATCC 333 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAA.CGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC AT 822 



2 ) INFORMATION FOR SEQ ID NO: 224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 25175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 
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CCCTGGTGCT ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGCCACAAA 50 

CTCGTGAACA CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 100 

GTCTTCATGA ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 150 

GGTTGAAATG GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 200 

5 ATGATATTCC AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 250 

ACTGCTCAAG AAGATATCAT CATGGAATTA ATGCATACTG TTGATGACTA 3 00 

CATTCCAGAT CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 350 

AAGATGTTTT CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 400 

GATCGTGGTA CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 450 

10 TGATGACATT CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 500 

AATTGGATGA AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 550 

ATCCAACGTG ATGAAATCGA ACGTGGTCAA GTTCTTGCTA AACCAGGTTC 600 

AATTCACCCA CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 650 

AGGAAGGTGG ACGTCATACA CCATTCTTCA ATAACTATCG TCCACAATTC 700 

15 TACTTCCGTA CAACTGACGT AACTGGTTCA ATTGAGTTGC CAGCAGGTAC 750 

TGAAATGGTT ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 800 

ATCCAATCGC TGTTGAACAA GGTACTA 827 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 225 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus parasanguinis 

(B) STRAIN: ATCC 15912 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

4 0 AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTCTAAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 3 00 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

45 GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

50 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2) INFORMATION FOR SEQ ID NO: 22 6 

( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 824 bases 
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<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 1964 5 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTGACTT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTATCAT GGAATTGATG AAAACAGTTG ATGAGTACAT 300 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 350 

20 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 40 0 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2 ) INFORMATION FOR SEQ ID NO: 227 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
15 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 7 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 250 

ATCATCATGG AATTGATGGA CACTGTTGAT GAGTACATCC CAGAACCAGA 300 

55 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 400 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

50 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 
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TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sobrinus 

(B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 

TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 200 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 2 50 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 300 

ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 400 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 450 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 600 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 650 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 229 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG TTGACGATGA AGAATTGCTT 
CCGTGACCTT CTTTCAGAAT ACGATTTCCC 
TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG 
ATCGTTATGG AATTGATGAA CACTGTTGAT 
5 ACGCGACACT GACAAACCAT TGTTGCTTCC 
TCAC TGGTCG TGGTACTGTA GCTTCAGGAC 
CGTGTCAACG ACGAAATCGA AATCGTTGGT 
AGCAGTTGTT ACTGGTGTTG AAATGTTCCG 
TTGCCGGCGA TAACGTTGGT GTGCTTCTTC 
10 AT CGAACGTG GTCAAGTTAT CTCTAAACCA 
TAAATTCAAA GGTGAAGTTT ACATCCTTAC 
ACACTCCATT CTTCGACAAC TACCGTCCAC 
GACGTAACTG GTTCAATCAA ATTGCCAGAA 
TGGTGATAAC GTTACTATCG ACGTTGAATT 

15 



GAGTTGGTTG AAATGGAAAT 150 

AGGTGATGAT CTTCCAGTTA 2 00 

GTGACTCTAA GTACGAAGAC 250 

GAGTACATTC CAGAACCAGA 3 00 

AGTCGAGGAC GTATTCTCAA 350 

GTATCGACCG TGGTACTGTT 4 00 

CTTCAAGAAG AAAAATCTAA 4 50 

TAAACAACTT GACGAAGGTC 500 

GTGGTGTACA ACGTGATGAA 550 

GGTTCTATCA ACCCACACAC 600 

TAAAGAAGAA GGTGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGCCG 7 97 



2) INFORMATION FOR SEQ ID NO : 23 0 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uteris 
30 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 

TTGTTGTTGC ATCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 2 00 

ATCCAAGGTT CAGCT CTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CCAGAACCAG 300 

AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGTACTGT 4 00 

TCGTGTCAAC GACGAAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCC TTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGTACTGAA ATGGTAATGC 7 50 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 793 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 231 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doubl e 

(D) TOPOLOGY: Linear 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 
5 (B) STRAIN: ATCC 49124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

10 CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA TATTCCAGTT 2 00 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GACTTGATGA ACACTGTTGA CGAATACATT CCAGAACCAG 3 00 

15 AACGTGACAC TGACAAACCA TTGTTGCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 4 00 

TCGTGTTAAT GACGAAGTTG AAATCGTTGG TCTTAAAGAA GAAATCCAAA 450 

AAGCAGTTGT TACTGGTGTA GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 550 

20 AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 

25 

2) INFORMATION FOR SEQ ID NO: 232 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella. ptyseos 
40 (B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 2 00 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 250 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

50 CGCGATGGAC GATGGTCTGC GTTTCGCAA 829 

139 
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2) INFORMATION FOR SEQ ID NO: 23 3 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 20 0 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

25 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCTGGTC GTGGTAC CGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

3 0 CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 234 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 2 00 

60 AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 

140 
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CAATATGTAG CTAAAATTGA CGAATTGATG GACGCTGTAG ACTCCTACAT 300 

CCCAACACCA GTTCGTGACA CTGATAAACC ATTCTTGATG CCTGTGGAAG 350 

ATGTTTTCAC AATCACTGGT CGTGGTACAG TAGCAACTGG CCGTGTTGAA 400 

CGTGGTCAAG TAAACGTTGG TGATACTGTT GAAGTAGTAG GCTTGAAAGA 4 50 

5 AAAAGCTGAA CAATACGTAG TAACAGGTCT TGAAATGTTC CGTAAAGTGT 500 

TGGATTCTGC AGTAGCAGGT GACAACGTAG GTGCATTGCT TCGTGGTGTT 550 

GATCGTAAAG ACATYGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCCAT 600 

CAACCCACAY ACAAAATTCA AAGCAGAAGT ATACGTATTG ACTAAAGAAG 650 

AAGGTGGTCG TCATACTCCA TTCTTCTCCA ACTACCGTCC ACAATTCTAC 700 

10 TTCCGTACAA CAGACGTAAC AGGTGTTGTA AACCTTCCTG AAGGTGTAGA 750 

AATGTGTATG CCTGGCGATA ACGTAACAAT GGAAATCGAA TTGATTACTC 8 00 

CAATCGCTAT CGAAGAAGGT CTTCG 825 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 235 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, enterocolitica. 

(B) STRAIN: ATCC 9610 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

35 AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 200 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 300 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 4 00 

40 GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 450 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

45 GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2 ) INFORMATION FOR SEQ ID NO : 236 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DN A 

141 
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(vi) ORIGINAL. SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 

GGCGCGATCC TGGTTGTTGC TGCCACTGAT GGCCCGATGC CACAGACTCG 50 

CGAGCACATT CTGTTAGGGC GTCAGGTGGG TGTTCCTTAC ATCCTGGTCT 100 

10 TCCTGAACAA ATGTGACATG GTTGACGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CTGGCGACGA 20 0 

CACTCCAGTT ATCCGTGGTT CTGCGCTGAA AGCGCTGGAk GGCGAAGCTG 2 50 

AGTGGGAAGC AAAAATCATC GAATTGGCTG AGGCGCTGGA TAGCTATATT 300 

CCACAGCCAG AGCGTGCGAT TGATAAACCA TTCCTGCTGC CAATCGAAGA 3 50 

15 CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTCACCGGT CGTGTAGAGC 400 

GCGGTATCGT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 450 

ACCATCAAGA CTACCTGTAC TGGTGTTGAA ATGTTCCGCA AATTGCTGGA 500 

CGAAGGCCGT GCGGGTGAGA ACGTTGGTGT TCTGCTACGT GGTACTAAAC 550 

GTGATGACGT ACAACGTGGT CAGGTATTGG CAAAACCAGG TTCTATCAAG 600 

20 CCACACACCA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGCCAT ACTCCGTTCT TCAAAGGTTA TCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTGAC CGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTGATGCCAG GTGATAACAT TCAAATGATT GTTAACCTGA TTGCTCCTAT 8 00 

CGCAATGGAT GACGGTCTGC GCTTTGCG 82 8 

25 



2) INFORMATION FOR SEQ ID NO: 23 7 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
£0 (B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

±5 TCCTGCTAGG TCGTCAGGTG GGTGTTCCTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG 2 00 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 250 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 3 00 

50 AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 350 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 400 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 450 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 500 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550- 

55 GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 600 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 700 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 750 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

SO ATGATGGTCT GCG 813 
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55 



2) INFORMATION FOR SEQ ID NO: 23 8 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D2 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC CGCAGACTCG 50 

TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCCCATAC ATTATTGTCT 100 

TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT AGAGTTGGTT 150 

GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC CAGGCGACGA 200 

CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA GGTGACGCTG 250 

AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA TAGCTATATT 3 00 

CCGCAACCAG AACGCGCTAT TGATAGACCA TTCCTATTGC CAATTGAAGA 350 

CGTATTCTCT ATTTCTGGTC GTGGTACTGT AGTTACTGGT CGTGTAGAAC 4 00 

GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG TATTATCGAT 450 

ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA AGCTGCTGGA 500 

TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG TTCTATCAAG 600 

CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

CGGCCGTCAT ACACCGTTCT TCAAGGGCTA CCGTCCTCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG CGTTGAAATG 750 

GTCATGCCTG GTGACAACGT AAACATGGTT GTTAACCTAA TTGCTCCTAT 800 

CGCAATGGAT GATGGTCTGC GCTTCGCAA 829 



2 ) INFORMATION FOR SEQ ID NO : 23 9 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, pseudotuberculosis 

(B) STRAIN: ATCC 2 9833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 



TGGAGCGATC TTGGTTGTTG 
GTGAGCATAT CCTGCTGGGC 
TTCCTGAACA AATGTGACAT 
TGAAATGGAG GTTCGTGAGC 
ACACTCCAGT CATCCGTGGT 



•CTGCAACCGA TGGCCCTATG 
CGCCAGGTTG GTGTCCCATA 
GGTTGACGAT GAAGAGTTGC 
TTCTGTCTCA ATACGATTTC 
TCAGCGTTGA AAGCCCTGGA 
143 



CCGCAGACTC 50 

CATTATTGTC 100 

TAGAGTTGGT 150 

CCAGGCGACG 200 

AGGTGACGCT 250 
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10 



GAGTGGGAAG 
TCCGCAACCA 
ACGTATTCTC 
CGCGGTATTG 
TACGATTAAA 
ATGAAGGCCG 
CGTGACGATG 
GCCACACACG 
GCGGCCGTCA 
CGTACAACTG 
GGTCATGCCT 
TCGCAATGGA 



CTAAAATTAT 
GAACGCGCTA 
TATTTCTGGT 
TTAAGGTCGG 
ACAACTTGTA 
TGCTGGTGAA 
TTCAGCGTGG 
AAGTTTGAGT 
TACACCGTTC 
ACGTGACCGG 
GGTGACAACG 
TGATGGT 



CGAGTTGGCA 
TTGATAGACC 
CGTGGTACTG 
CGAAGAAGTT 
CTGGCGTTGA 
AATGTTGGTG 
TCAAGTACTG 
CAGAAGTTTA 
TTCAAGGGCT 
TACCATTGAG 
TAAACATGGT 



GAAGCTCTGG 
ATTCCTATTG 
TAGTTACTGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCGAAACCAG 
TATTCTGAGC 
ACCGTCCTCA 
CTGC CAGAAG 
TGTTAACCTA 



ATAGCTATAT 3 00 

CCAATTGAAG 350 

TCGTGTAGAA 4 00 

GTATTATCGA 4 50 

AAGCTGCTGG 500 

TGGTACTAAG 550 

GTTCTATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GCGTTGAAAT 750 

ATTGCTCCTA 800 

817 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 240 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
-(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN : ATCC 43380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 



TGGCGCGATC 
GCGAGCACAT 
TTCCTGAACA 

35 TGAAATGGAA 
ACACTCCGGT 
GAGTGGGAAG 
TCCACAGCCA 
ACGTATTCTC 

40 CGCGGTATCG 
CACCATTAAA 
ACGAAGGCCG 
CGCGATGACG 
GCCGCATACC 

45 GTGGTCGTCA 
CGTACAACTG 
GGTGATGCCA 
TTGCAATGGA 



CTGGTTGTTG 
CCTGTTGGGT 
AGTGTGACAT 
GTTCGTGAGC 
TATTCGTGGT 
CCAAAATTAT 
GAGCGCGCGA 
TATCTCAGGC 
TCAGAGTGGG 
ACCACTTGTA 
TGCCGGTGAG 
TGCAACGTGG 
AAATTTGAGT 
TACTCCGTTC 
ACGTGACCGG 
GGTGATAACA 
TGACGGTCTA 



CTGCAACTGA 
CGTCAAGTGG 
GGTTGACGAC 
TTCTGTCTCA 
TCCGCGCTGA 
TGAACTTGCT 
TTGATAAACC 
CGTGGAACAG 
CGAAGAAGTT 
CCGGTGTTGA 
AACGTTGGTG 
TCAAGTGTTG 
CAGAAGTTTA 
TTCAAAGGTT 
TACCATCGAA 
TTCAAATGAT 
CGATTTGCA 



TGGCCCAATG 
GTGTTCCTTA 
GAAGAGTTGC 
ATACGATTTC 
AAGCGCTGGA 
GAAGCACTGG 
ATTCTTGCTG 
TTGTTACCGG 
GAAATCGTGG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAACCAG 
TATTCTGAGC 
ACCGTCCACA 
CTGCCAGACG 
TGTTAACCTG 



CCACAGACTC 50 

CATCTTAGTC 100 

TGGAACTGGT 150 

CCTGGCGATG 2 00 

AGGCGAGGCC 250 

ATAGCTACAT 3 00 

CCAATCGAAG 350 

GCGTGTTGAG 4 00 

GTATCATCGA 450 

AAACTGCTGG 500 

CGGTACTAAA 550 

GTTCTATTAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GTGTTGAGAT 750 

ATTGCGCCTA 800 

829 



50 



55 



2) INFORMATION FOR SEQ ID NO: 241 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

144 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 3 5313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGT C AGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 2 50 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAAC TGCTGG 50 0 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 60 0 

20 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG TAC CATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGC 804 

25 



2) INFORMATION FOR SEQ ID NO: 242 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp- 
40 denitr if icans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

15 TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 50 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 100 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 15 0 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 200 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 300 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 350 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 400 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 450 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 50 0 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 550 

GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

CCGAGAAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 700 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

SO TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 800 
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2 ) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 

TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCAT CGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT TACTGTAACT 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 150 

TATCATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 2 00 

CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAACTGGT ATTAAAGTCA TCGACTTACT 300 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 350 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 4 00 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 450 

TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTGACTAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 600 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACACTTGC AGGTACTGAA GTATCAGCAT TGTTAGGTCG TATGCCATCT 700 

GCGGTAGGTT ACCAACCTAC ACTTGCAGAA GAAATGGGTG TTCTTCAAGA 750 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 787 



2 ) INFORMATION FOR SEQ ID NO: 244 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter Iwoffii 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAACTAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 2 00 

TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 
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TGCCGATTCA CCGTCAAGCA CCTTCTTATG CTGAACAAGC AGCTTCTACT 300 

GACCTTTTAG AAACTGGTAT TAAAGTCATC GACTTACTTT GCCCGTTCGC 350 

TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG TGCCGGTGTT GGTAAAACTG 4 00 

TAAACATGAT GGAGTTGATC AACAACATCG CTAAAGCGCA CTCAGGTTTA 450 

TCTGTATTCG CTGGTGTTGG TGAGCGTACT CGTGAAGGTA ATGACTTCTA 500 

TCACGAGATG AAAGACTCAA ACGTTCTAGA CAAAGTAGCA ATGGTCTACG 550 

GTCAGATGAA CGAGCCACCG GGTAACCGTT TACGCGTAGC GTTGACTGGT 600 

TTGAC CATGG CTGAGTACTT CCGTGACGAG AAAGACGAAA ACGGCAAAGG 650 

CCGTGACGTA CTATTGTTCG TAGATAACAT CTACCGTTAT ACACTAGCAG 70 0 

GTACTGAAGT ATCAGCACTT CTAGGTCGTA TGCCGTCTGC AGTAGGTTAC 750 

CAACCGACAC TTGCAGAAGA GATGGGTGTT CTTCAAGAAC GTATTACATC 800 

GACTAAGTCT GGTTCGATTA CGTCA 825 



2) INFORMATION FOR SEQ ID NO: 245 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 83 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 24 5 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 246 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 

TTCCCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 50 

CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTTGAA GTACAGCAAC 10 0 

10 AATTGGGTGA CGGTGTAGTT CGCACCATCG CCATGGGTTC CAGCGACGGC 15 0 

CTGCGCCGCG GTATGGAAGT GGCCGGTTCG GGCGCTCCCA TCTCCGTTCC 2 00 

CGTGGGTGTC GGCACCCTGG GTCGCATTAT GGACGTTCTG GGTCGCCCTA 250 

TTGACGAAGT CGGTCCTATT CAGTCCGACG AGCGTCGCGC CATTCACCAG 3 00 

CCTGCGCCTA CTTTCGACGA ACTGTCGCCT TCCGTAGAGC TGCTGGAAAC 350 

15 CGGTATTAAA GTGATTGACC TGGTTTGCCC GTTCGCCAAG GGTGGTAAGG 400 

TTGGTCTGTT CGGTGGTGCC GGTGTGGGCA AGACCGTGAA CATGCTGGAG 4 50 

CTGATCAACA ACATCGCCAA GGCACACAGC GGTCTGTCCG TGTTTGCCGG 500 

TGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGGCCG 550 

ATGCTGGCGT TATCCAGATG GACAACCTGA GCGAGTCCAA AGTGGCCATG 600 

2 0 GTGTTCGGTC AGATGAACGA ACCTCCAGGC AACCGTCTGC GTGTGGCACT 650 

GTCCGGCCTG ACCATGGCCG AGAAGTTCCG TGACGAAGGC CGTGACATCC 7 00 

TGTTCTTTGT GGACAACATC TACCGCTACA CGCTGGCCGG TACAGAAGTG 750 

TCCGCTCTGC TGGGTCGTAT GCCTTCCGCA GTGGGTTACC AGCCTACGCT 800 

GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG 850 

25 G 851 



30 



2) INFORMATION FOR SEQ ID NO: 247 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
B5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
i0 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

15 GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 2 50 

* 50 ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 3 00 

' CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

55 TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

50 ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 846 



5 2) INFORMATION FOR SEQ ID NO: 248 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 



CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 2 00 

25 ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 2 50 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 3 00 

GAATTATCTA CTAAAGTAGA AATT CTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 450 

3 0 CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

35 CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT 810 



£0 



50 



2) INFORMATION FOR SEQ ID NO: 249 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 944 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

50 CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 200 
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TGCCGACCGG CGACCAAGTC AAAGGACGTT TAATGAATGT CACCGGCGAC 250 

CCTATCGATG GCATGGCCCA GCTTACTAAA GACGGGGCTC TTCCCATCCA 3 00 

TCGTGAGCCT CCTAAATTCG AGGATCTGAC AACGACCCAA GAGGTCTTGT 3 50 

ACACGGGTAT TAAAGTAATC GATTTATTGG AACCTTACGC CAAAGGAGGC 400 

5 AAGATCGGAC TTTTCGGAGG AGCCGGAGTC GGCAAAACGG TATTGATCAT 450 

GGAATTGATC AACAACATCG CAAAGAAAAA CAACGGATTC TCCGTCTTCG 500 

CCGGTGTGGG TGAACGTACA CGTGAAGGAA ATGACCTATT GCGTGAAATG 550 

ATCCAATCCG GTGTCATCCG ATACGGCGAG GAGTTTAAGA AGAGTATGGA 600 

GGCAGGCAAC TGGGACTTGT CGAAGATCGA TTACGATGAA TTGGCTAAGT 650 

10 CGCAGGCTAC CTTGGTATTC GGGCAGATGA ACGAACCGCC GGGCGCCCGC 700 

TCGTCCGTGG CCTTATCCGG TTTGACGATC GCCGAATCCT TCCGTGACAA 750 

AGCCTCTGAG GGAGAAAGAA AAGATATATT ATTCTTCATC GATAATATCT 800 

TCCGTTTCAC CCAAGCCGGT TCCGAGGTTT CCGCCTTGCT GGGGCGTATG 850 

CCTTCCGCCG TAGGTTACCA ACCGACATTG GCCACGGAAA TGGGAGCTAT ' 900 

15 GCAAGAGCGT ATCACTTCAA CCAAGAAGGG CTCCATCACC TCCG 944 



20 



2) INFORMATION FOR SEQ ID NO: 250 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Bacteroides ova t us 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 



35 TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGC CAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 10 0 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 20 0 

ATTGGCGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

40 CGACGGTATG AAAGGACTCG ACCGCAAAGG TGCATATTCC ATTCATCGCG 3 00 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GCTCTTCACA 350 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 400 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 500 

45 GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 700 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 750 

50 AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 800 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 90 0 

AACGTATCAC GTCTACCCGC AAAGGTTCTA TCACCTCCG 93 9 

55 



2 ) INFORMATION FOR SEQ ID NO: 251 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 833 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 



15 



20 



25 



30 



CGAATTCCCT 
AGAATGGTAA 
GGTATTGTGC 
TCTGGAAGTT 
CAACCCTGGG 
GGCGACATCG 
CTATGAAGAG 
TTATCGACCT 
GGCGGTGCGG 
CATCGCGATC 
GTACTCGTGA 
CTGGACAAAG 
CCGTCTGCGC 
ACGAAGGTCG 
CTGGCCGGTA 
AGGCTACCAG 
TCACCTCTAC 



CAGGATGCCG 
TGAGAGCCTG 
GTACCATCGC 
AAAGACCTCG 
CCGTATCATG 
GCGAAGAAGA 
CTGTCCAGCT 
GATGTGTCCG 
GTGTAGGTAA 
GAGCACTCCG 
GGGTAACGAC 
TATCCCTGGT 
GTTGCGCTGA 
TGACGTACTG 
CGGAAGTATC 
CCGACCCTGG 
CAAAACCGGT 



TACCGCGCGT 
GTGCTGGAAG 
CATGGGTTCT 
AGCACCCAAT 
AACGTCCTGG 
GCGTTGGGCT 
CTCAGGAACT 
TTCGCGAAGG 
AACCGTAAAC 
GTTACTCCGT 
TTCTACCACG 
TTACGGCCAG 
CCGGCCTGAC 
CTGTTCGTTG 
CGCACTGCTG 
CGGAAGAGAT 
TCTATCACCT 



GTACGATGCT 
TTCAGCAGCA 
TCCGACGGTC 
CGAAGTACCA 
GTCAGCCGAT 
ATCCACCGTT 
GCTGGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
GTTTGCAGGC 
AAATGACCGA 
ATGAACGAGC 
CATGGCTGAG 
ACAACATCTA 
GGTCGTATGC 
GGGTGTTCTG 
CCG 



CTTGAGGTTC 50 

GCTCGGCGGC 100 

TGCGTCGTGG 150 

GTAGGTAAAG 200 

CGACATGAAA 250 

CAGCACCTTC 300 

GGCATCAAAG 350 

TGGTCTGTTC 4 00 

TGATCCGTAA 450 

GTGGGTGAGC 500 

CTCCAACGTT 550 

CACCAGGAAA 600 

AAGTTCCGTG 650 

CCGTTACACC 700 

CATCAGCAGT 750 

CAGGAACGTA 800 

833 



2) INFORMATION FOR SEQ ID NO: 252 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas mal tophi lia 
45 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 50 

50 GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 100 

ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 150 

GTGCCGGTCG GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 200 

TCCGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 

ACCGCGCTGC GCCGTCGTAT GAAGAC CAGT CCCCGGCCAC CGAGCTGCTG 300 

55 GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 350 

CAAGGTCGGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC GTCAACATGA 400 

TGGAACTGAT CAACAACATC GCCAAGGCGC ACAGCGGCCT GTCCGTGTTC 450 

GCCGGCGTGG GTGAGCGTAC CCGTGAGGGC AACGACTTCT ACCACGAGAT 500 

GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTAC GGCCAGATGA 550 

60 ACGAGCCGCC GGGCAACCGT CTGCGCGTCG CCCTGACCGG CCTGACCATG 600 
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5 TGGTTCGATC ACCTCGATC 



PCT/CA00/01150 

AACGGCAAGG GCAAGGACGT 650 
CACCCTGGCC GGTACCGAAG 7 00 

CGGTGGGTTA CCAGCCGACC 750 
CGCATCACCT CGACCAAGAA 800 

819 



10 



2) INFORMATION FOR SEQ ID NO: 253 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

2 5 AGTTTGAAGG CCCTTTGCCA AATATTCTCA ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG ATACTACCGA TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 2 00 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 250 

30 AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 300 

TGGAGCAATC AACCGCATCA GAAATCCTTG TGACTGGTAT TAAAGTCGTT 350 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 400 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 450 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500 

35 CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCGAAAGCC GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 650 

TCAGGATTGA CCATTGCAGA AAGTTTCCGT GATGAGGGAC AAGATGTTTT 700 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 750 

10 CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 

CTCTATTACC TCTG 864 



45 



55 



2) INFORMATION FOR SEQ ID NO: 254 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 866 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 

152 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 5 0 

GKCCAACACC GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

TTGMGGTTGA ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 150 

5 AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 200 

CCCGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGCCAC GTATTCGACG 250 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 30 0 

GAACGTTGGT CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 35 0 

CAAG AC CC AG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 4 00 

10 CGTATGTGCA GGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGTGC TGATCCAGGA AATGATTCAG CGTGTGGCTC AGAACCACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT GAGGGTAACG 550 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 60 0 

GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 650 

15 GACCGCACTG ACCATGGCAG AGTACTTCCG TGACGTACAG AATCAGGACG 700 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 75 0 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 800 

CCTGGCCGAT GAGATGGGCG CGCTGCAGGA GCGAATCACT TCGACCCGTG 850 

GACACTCCAT CACCTC 866 

20 

2) INFORMATION FOR SEQ ID NO: 2 55 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
35 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

AAGGC CAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 5 0 

4 0 CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 100 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 150 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 2 00 

CGCAT CATGA ACGTCATCGG CGAGCCGGTT GACGAAGCAG GCCCCATCAA 2 50 

GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 300 

45 AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 350 

CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG GCGGTGCAGG 400 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 450 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 500 

GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 5 50 

50 CGGCGGCGAA GGCTCCAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 600 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 650 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 700 

CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTCTTCTC GGCCGTATTC 750 

CTTCCGCTGT GGGTTATCAG CCGACGCTGG CAACCGACAT GGGCGCCATG 800 

55 CAGGAACGCA TCACCACGAC GACCAAGGGT TCGATCACCT CG 842 



60 



2) INFORMATION FOR SEQ ID NO: 256 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 50 

AAAATAACAG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 2 00 

CGACCCTTGG CCGTATCATG AACGTGCTGG GTCAGCCGAT CGACATGAAA 250 

2 0 GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 30 0 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCGACCT TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 400 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 5 00 

25 GTACTCGTGA GGGTAACGAC TTCTATCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA C CGGTC TGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 7 00 

CTGGCCGGTA C TGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 750 

30 AGGTTACCAG CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 833 



35 2 ) INFORMATION FOR SEQ ID NO: 257 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
£0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 

50 

TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

55 TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

SO GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 

154 
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TCGTGAGGGT AACGACTTCT ACCACGAGAT 
ACAAAGTTGC ACTGGTTTAC GGCCAGATGA 
CTGCGCGTAG CGCTGACCGG TCTGACCATC 
AGGCCGTGAC GTTCTGCTGT TCGTCGATAA 
5- CCGGTACAGA AGTTTCTGCA CTGCTGGGTC 
TATCAGCCTA CTCTGGCAGA AGAGATGGGT 
CTCCACCAAG ACCGGTTCCA TCACTTCCG 



PCT/CA00/01150 

GACCGACTCC AACGTTCTGG 550 

ACGAGCCGCC AGGTAACCGT 600 

GCGGAGAAAT TCCGTGACGA 650 

CATCTATCGT TATACCCTGG 700 

GTATGCCATC TGCGGTAGGT 750 

GTTCTTCAGG AGCGTATTAC 8 00 

829 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO : 258 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 0 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea. neteri 

(B) STRAIN: ATCC 33855 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGCTTGA GGTACAGAAC 50 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 100 

CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 150 

30 TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 2 00 

CTTGGCCGTA TCATGAACGT GCTGGGTCAG CCGATCGACA TGAAAGGCGA 2 50 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 3 00 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 350 

GACCTGATTT GTC CGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 4 00 

35 TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 4 50 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA ACGTTCTGGA 550 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 600 

TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 650 

40 GGCCGCGACG TTTTGCTGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 700 

CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 750 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC 800 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA 83 0 



2) INFORMATION FOR SEQ ID NO : 259 



( i > SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chryseobacterium meningosepticum 

(B) STRAIN: CDC B7681 

50 

155 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 5 0 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 2 00 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 2 50 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 3 00 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 3 50 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 4 00 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG AAAGAGTCTA AAGCTGCTTT 650 

CGTTTT CGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 7 00 

TTTCTGGTCT TACTTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 8 00 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

2 0 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC T ATT AC AT CT G 931 



25 2) INFORMATION FOR SEQ ID NO: 260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE; 

(A) ORGANISM: Ci trobacter amalonaticus 

(B) STRAIN: ATCC 25405 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 200 

45 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 250 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 300 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 350 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 4 00 

TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 450 

50 TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 70 0 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



2) INFORMATION FOR SEQ ID NO: 2 61 

50 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Ci troJbacter braakii 

(B) STRAIN: ATCC 43162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

15 GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTCGTAT 200 

CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 2 50 

20 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

25 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 700 

TATCCGCTCT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCGACC 750 

30 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC AC 812 



35 2) INFORMATION FOR SEQ ID NO: 262 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci troJbacter koserl 

(B) STRAIN: ATCC 27156 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 



GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTTCA GCAGCAGCTC GGTGGCGGTA TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG ACCTTGAGCA 150 

CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 200 

55 TACTGGGCGA ACCAGTAGAC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGCTATCC ACCGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 3 00 

GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 350 

CGAAGGGCGG TAAAGTGGGT CTGTTCGGTG GTGCGGGTGT AGGTAAAACC 400 

GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 4 50 

SO CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 

157 
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AC C ACGAAAT GACCGACTCC AACGTTATCG 
GGCCAGATGA ACGAGCCGCC GGGAAACCGT 
CCTGACCATG GCGGAGAAAT TCCGTGACGA 
TCGTCGACAA CATCTACCGT TACACCCTGG 
5 CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 
AGAGATGGGT GTTTTGCAGG AACGTATCAC 
TCACCTCCGT A 
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ACAAAGTATC CCTGGTTTAC 550 

CTGCGCGTTG CGCTGACCGG 600 

AGGTCGTGAC GTTCTGCTGT 650 

CCGGTACGGA AGTATCCGCA 700 

TACCAGCCGA CCCTGGCGGA 750 

CTCCACCAAA ACCGGTTCTA 8 00 

811 



10 



20 



45 



60 



2) INFORMATION FOR SEQ ID NO: 263 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 816 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter farmer i 

(B) STRAIN: ATCC 51112 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 10 0 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

3 0 GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

35 TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

40 CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGT ATGC CAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 800 

AACCGGTTCT ATTACC 816 



2) INFORMATION FOR SEQ ID NO: 264 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 

AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 50 

GAGAGCCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG 100 

5 TACCATCGCC ATGGGTTCTT CTGACGGTCT GCGTCGTGGT CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 200 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

TGAAGAAGAG CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 300 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

10 ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 4 00 

TGTAGGTAAA ACCGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG 450 

AACACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCTGGAAAC CGTCTGCGTG 600 

15 TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GACGTTCTGC TGTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 70 0 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 8 00 

AAAACCGGTT CTATCACCT 819 

20 



2) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci tjrobactez" koseri 
35 (B) STRAIN: ATCC 27028 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 



GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

40 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 2 50 

AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 3 00 

45 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAA 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 800 

55 AACCGGTTCT ATCACCTCCG TA 822 



60 



2) INFORMATION FOR SEQ ID NO: 266 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 66 

15 GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTACAGA ATGGTAATGA 50 

GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATTGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTTGAGC ACC CGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GACATCGGTG 250 

20 AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA TGAAGAGTTG 300 

TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGAC CTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

25 TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG TCTGCGCGTC 600 

GCACTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

3 0 ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCAACCAA 800 

AACCGGTTCT ATCACCTCCG 820 



35 2) INFORMATION FOR SEQ ID NO: 267 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 200 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 3 00 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

50 TCCGGTTACT CAGTGTTTGC GGGCGTTGGT GAACGTACTC GTGAGGGTAA 500 

160 
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CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTATATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGTC TGACCATGGC TGAGAAGTTC 
TCTGCTGTTC GTTGATAACA TCTATCGTTA 
5 TATCTGCACT GCTGGGTCGT ATGCCATCAG 
CTGGCGGAAG AGATGGGTGT TCTGCAGGAA 
CGGTTC 



PCT/CA00/01150 

CGTTCTGGAC AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 7 00 

CGGTAGGCTA CCAGCCAACC 750 

CGTATCACCT CTACCAAAAC 8 00 

806 



10 



2 ) INFORMATION FOR SEQ ID NO: 268 



(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Ci trobacter youngae 

(B) STRAIN: ATCC 29935 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

3 0 CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTT CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 250 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

35 GTAAAACCGT AAACATGATG GAGCTTATTC GTAAC ATCG C GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

10 ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC 810 

15 

2) INFORMATION FOR SEQ ID NO : 2 69 

( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

50 

161 
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WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 

TTGAGAACGG AGATTTGCCG CAGCTATTGA CCGCTATTGA AATTCCTCTG 50 

AAAGACAGTG AATCTCTGAT TGTCGAAGTT GCTCAGCATA TCGGTGATGA 100 

5 ACGTGTCCGC TGTATCGCTA TGGGCGGTAC AGATGGTCTG GTTCGTGGAA 150 

TGGAAGCCAT TGATACAGGA TCCGCAATCC GTGTACCGGT GGGAAAAGAA 2 00 

ATTCTGGGAA GAATGTTCAA TGTCCTCGGA CGTGAAATTG ATGGTCTGGG 250 

ACCTGTAGGA ACGGATAACA CACTGCCGAT CCACAGACAG GCACCGGGCT 3 00 

TTGAGGAGCA GCAGACATCC GCAGAAATGC TGGAAACAGG AATTAAGGTC 350 

10 ATTGACCTGT TATGTCCATA TTCCAAGGGT GGTAAGATTG GTTTGTTTGG 4 00 

TGGTGCGGGA GTAGGTAAAA CCGTACTGAT TCAGGAGCTG ATTCATAATA 450 

TCGCCAAGGA ACATGGTGGA ATGTCCGTCG TTACCGGTGT AGGGGAGAGA 500 

ACCCGTGAAG GAAACGACAT GTATCATGAA ATGAAGGACA GCGGTGTCCT 550 

TGATAAGACC GTACTGGTTT ACGGACAGAT GAATGAATCA CCGGGTGCCA 600 

15 GAATGCGTGT CGGTCTGACC GGGCTGACGA TGGCGGAATA TTTCCGTGAT 650 

CACGACCATC AGGATGTATT GCTGTTTATT GATAATATTT TCCGTTTTAC 700 

CCAGGCGGGA AGTGAAGTAA GTGCCCTGCT GGGACGTATG CCAAGTGCAG 75 0 

TAGGCTATCA GCCGACACTT GCGACAGAAA TGGGACAGCT GCAGGAGCGC 800 

ATTACATCCA CGAAGGATGG TTCCATT 827 

20 

2) INFORMATION FOR SEQ ID NO: 270 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium p erfri ngen s 
35 (B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

GCAAATGATG AACTTCCTAA TATATTTAAT GCAATACACA TAAAAATGGA 50 

40 CGATGGAAAA ATCTTAGTTT GTGAGGTAGA GCAACACGTA GGAGACGATA 100 

TAGTTAGAAC TATAGCTATG GAAGCTACTG AAGGACTAAG AAGAGGTGTA 150 

GAAGCTGTTG ATACAGGAGC ACCTATATCA GTACCAGTTG GTGAATGCGT 200 

ATTAGGAAGA ATATTTAACG TATTAGGTAA ACCACTAGAT AGTGGAGCTG 250 

AAGTTAATAA CGAAGAAAAA TATCCAATTC ATAGACCAGC TCCATCATTT 3 00 

45 GAAGAACAAT CAGTTGTTCC TCAAATGTTT GAGACAGGAA TAAAGGTTAT 3 50 

CGACCTTTTA GCACCTTACC AAAGAGGGGG AAAAATCGGT CTATTTGGAG 400 

GTGCAGGTGT TGGTAAAACA GTTCTTATCC AAGAGCTTAT AAACAACATA 45 0 

GCTAAAGAGC ACGGTGGACT TTCTGTATTC ACAGGAGTTG GAGAAAGATC 500 

AAGAGAAGGT AATGACCTTT ACTATGAAAT GATGGAATCA GGAGTTATAA 55 0 

50 AAAATACAGC ATTAGTATTT GGACAAATGA ACGAACCACC TGGAGCAAGA 600 

ATGAGAGTTG CTTTAACAGG ACTTACTATG GCTGAGTACT TCAGAGACCA 650 

AGGTCAAGAC GTGTTATTAT TCATAGATAA CATATTCAGA TTCTCACAAG 700 

CTGGATCAGA GGTTTCAGCT TTATTAGGAA GAATACCATC AGCTGTTGGT 750 

TACCAACCAA CTCTTGCTAC AGAGATGGGA GCTCTTCAAG AGAGAATCAC 800 

55 ATCAACTACC CATGGATCAA TTACATCAG 829 
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2) INFORMATION FOR SEQ ID NO: 271 

162 
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WO 01/23604 PCT/CA00/01150 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
TTGTCCACGT TGGATRTCTT CA 22 



15 2) INFORMATION FOR SEQ ID NO: 272 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium. diphtheriae 

(B) STRAIN: ATCC 27010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



CCCGCGTGGC GAGCTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA 50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTGCT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 2 00 

35 GCGACGTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGCTTGGAT 250 

GAGCCAGGTC TCGGCCGCGA TGGTGAGCAG TGGGGAATTC ACCGCGATCC 300 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 350 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 400 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

4 0 CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

45 CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 8 00 

AGGAGCGTAT TACCTCTA 818 



50 

2) INFORMATION FOR SEQ ID NO: 273 

( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 833 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coirynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

CGATGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 50 

GTCGCCAAGA CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 

GATCCGGACC ATTGCGTTGG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 

10 AGGTTATCGA CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 2 00 

AAAGGACACG TCTTCAATGC GCTCGGTGAG TGTTTGGACG AACCAGGATT 250 

GGGCCGCGAC GGCGAACAGT GGGGAATCCA CCGCGATCCG CCACCATTCG 300 

ATGCGCTGGA GGGCAAAACC GAGATTCTGG AGACTGGAAT CAAGGTTATC 350 

GACCTCCTTA CCCCTTACGT TAAGGGTGGC AAAATTGGTC TGTTCGGTGG 400 

15 CGCCGGCGTC GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 

CTCGTAACTT CTCCGGTACT TCCGTGTTCG CCGGCGTCGG TGAGCGTACC 50 0 

CGTGAGGGTA CTGAC CTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 55 0 

AGACACCGCC CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTCGTA 600 

TGCGCGTGGC CTTGTCTGGT CTAACCATGG CTGAATATTT CCGCGACGTT 650 

2 0 CAAAACCAGG ACGTTTTGTT GTT CATTGAC AACATCTTCC GTTTTACTCA 70 0 

GGCAGGTTCC GAGGTTTCCA CGCTGTTGGG CCGTATGCCT TCCGCCGTGG 750 

GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 800 

ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 833 



25 



2) INFORMATION FOR SEQ ID NO: 2 74 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 417 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 74 



10 



CCGTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 50 

ACCTCTGTGT TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

15 CTGGGTCGAG ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

TCGGCCAGAT GGACGAGCCG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 2 00 

GCGCTGACCA TGGCGGAGTA CTTCCGCGAT GTGCAGAACC AGGACGTGCT 2 50 

GCTGTTCATC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 3 00 

CCACCCTGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 350 

50 GCGGACGAGA TGGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 4 00 

CTCCATCACC TCGATGC 417 



55 2) INFORMATION FOR SEQ ID NO: 275 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

164 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43 042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

10 

GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG TTGACCTCGA 50 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 100 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCGGT 150 

GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG TCGGCGATGT 2 00 

15 TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 250 

GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 3 00 

TTCGACGAGC TCGAGGGTAA GAC CGAGATC CTGGAGACCG GCGTTAAGGT 350 

CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 4 00 

GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 4 50 

2 0 ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 500 

TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 550 

TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC GCCGGGAGTC 600 

CGTATGCGGG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT ACTTCCGCGA 650 

TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC TTCCGTTTCA 700 

25 CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA 750 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG 800 

CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC 835 



30 



35 



10 



£5 



2) INFORMATION FOR SEQ ID NO: 276 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 276 



CCCGTCACGC 
AATTTAACGC 
AATTGCCATG 

50 ATACGGAAAA 
ATCATGAACG 
AAAAGAAAAA 
CTGGCGCTAC 
TGCCCCTTTG 

55 TGGAAAAACG 
ACAGCGGTTA 
AATGATTTTT 
GTTGGTGTAC 
GTTTGACGGG 

50 GTTCTGTTAT 



TGTCCCGAAA 
TCGAAGTCCA 
GGCAGCACTG 
ACCGATTGAA 
TGCTGGGTGA 
CTCCCTATTC 
CGAATTATTA 
CTAAGGGAGG 
GTTAATATGA 
TTCTGTTTTT 
ATCATGAAAT 
GGACAAATGA 
GCTTACGCTG 
TTATCGATAA 



GTGTACGATG CTTTGCAGGT 
ACAGCAACTC GGGGACGGTG 
AGGGCTTAAA ACGCGATATC 
GTTCCCGTAG GAAAAGAAAC 
GCCGATCGAT GAGTTAGGTC 
ATCGTCCTGC GCCGAGCTTT 
GAAACCGGTA TTAAAGTGGT 
CAAAGTGGGT CTTTTTGGAG 
TGGAATTAAT CCGTAATATC 
GCGGGTGTGG GAGAAAGAAC 
GAAAGAATCC AATGTCTTGG 
ACGAGCCGCC AGGGAACCGC 
GCGGAAGCCT TCCGTGACGA 
TATCTTTCGT TACACTTTGG 
165 



GGATGAAAAT 50 

TCGTGCGCAC 100 

GCCGTAAAAA 150 

TTTAGGTCGT 200 

CCATTAATTC 2 50 

ATTGAGCAAT 3 00 

CGATTTGCTT 350 

GCGCGGGCGT 4 00 

GCCATTGAAC 450 

GCGAGAAGGC 500 

ATAAAGTGGC 550 

TTGCGGGTGG 600 

AGGACGCGAC 650 

CAGGGGTTGA 700 
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AGTCTCTGCC CTCCTCGGTC GGATGCCATC GGCTGTGGGT TATCAGCCGA 750 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG AACGCATTAC TTCCACTAAA 800 
AAAGGGTCCA TTACGTCG 818 



2 ) INFORMATION FOR SEQ ID NO: 277 



(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE; Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i el la hoshinae 

(B) STRAIN: ATCC 33379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 



TTCCCGCAGG ACGCCGTGCC GAAGGTGTAT AACGCACTGG AAGTAAAAGG 50 

CGGTGCCACG AAACTGGTAC TGGAAGTGCA GCAGCAGCTG GGTGGCGGCG 100 

25 TAGTTCGCTG CATCGCGATG GGCTCTTCCG ACGGTCTGCG CCGTGGGCTA 150 

GAGGTTGAAG ACCAAGACCA TCCGATCGAG GTTCCTGTTG GCAAGGCGAC 200 

TCTGGGCCGT ATCATGAACG TACTGGGTGA TCCGGTCGAC ATGAAGGGCG 250 

AGATCGGTGA AGAAGAGCGT TGGGCTATCC ATCGTGCTGC ACCGAGCTAT 300 

GAAGATCTGT CTAACTCTCA GGAACTGCTG GAGACCGGCA TTAAGGTTAT 350 

3 0 CGACCTGATT TGCCCGTTCG CTAAAGGCGG TAAAGTGGGC CTGTTCGGTG 4 00 

GGGCCGGTGT GGGTAAGACC GTTAACATGA TGGAGCTTAT CCGTAACATC 450 

GCTATCGAGC ACTCCGGTTA CTCAGTCTTC GCCGGTGTGG GTGAGCGTAC 50 0 

CCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGATTCC AACGTATTGG 550 

ATAAAGTTTC TCTGGTGTAT GGTCAGATGA ACGAGCCACC GGGAAACCGT 600 

35 CTGCGCGTGG CGCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGATGA 650 

AGGTCGTGAT GTACTGTTGT TCATCGATAA CATCTACCGT TAT AC CTTGG 700 

CCGGTACTGA AGTCTCCGCT CTGCTGGGCC GTATGCCGTC GGCGGTAGGT 750 

TATCAGCCGA CTCTGGCGGA GGAAATGGGG GTGCTGCAAG AGCGTATTAC 8 00 

CTCCACTAAG ACCGGGTCCA TCACCTCTG 82 9 

40 



2 ) INFORMATION FOR SEQ ID NO: 2 78 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i el la tarda 
55 (B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 50 
50 ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 

166 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC 
CAAGACCATC CGATCGAGGT TCCGGTCGGT 
CATGAACGTA CTGGGTGATC CGGTCGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCGC 
5 AACTCTCAGG AGCTGCTGGA GACCGGCATC 
CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT 
GTAAGACCGT TAACATGATG GAGCTTATCC 
TCCGGTTACT CCGTATTTGC CGGTGTAGGC 
CGACTTCTAC CACGAGATGA CCGACTCCAA 

10 TGGTATACGG CCAGATGAAT GAGCCGCCGG 
TTGACCGGTC TGACCATGGC GGAGAAATTC 
GTTGTTGTTC ATCGATAACA TTTATCGTTA 
TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG 
CTGGCGGAAG AGATGGGTGT GTTGCAAGAG 

15 GGGCTCTAT 



PCT/CA00/01150 

GTGGGCTGGT T3GTTGKAGKC ' 3*5^ 

AAGGCGACCC TGGGCCGTAT 200 

GAAGGGCGAG ATCGGCGAAG 250 

CGAGCTATGA AGATCTGTCC 3 00 

AAGGTTATCG ACCTGATTTG 350 

GTTCGGTGGT GCCGGTGTGG 400 

GTAACATCGC TATCGAGCAC 450 

GAGCGTACCC GTGAGGGTAA 500 

CGTATTGGAT AAAGTTTCTC 550 

GAAACCGTCT GCGTGTGGCG 600 

CGTGATGAAG GTCGCGATGT 650 

TACCTTGGCT GGTACCGAAG 700 

CGGTAGGTTA TCAGCCGACC 750 

CGTATCACCT CAACGAAGAC 800 

809 
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2 ) INFORMATION FOR SEQ ID NO: 279 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

35 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 100 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 150 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 200 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 250 

40 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 300 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 350 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 4 00 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 450 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

45 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 650 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 700 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 750 

50 TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 800 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 840 



55 2) INFORMATION FOR SEQ ID NO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 



GCCGTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT GTACGTACCA 100 

TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTCTGGA AGTAACGGAC 150 

CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

15 CATGAACGTA CTGGGCGAAC CAGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

20 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGAC GAAG GTCGTGACGT 650 

ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

2 5 TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA TCAGCCGACG 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGG 803 



30 



2) INFORMATION FOR SEQ ID NO: 281 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 

(B) STRAIN: ATCC 33072 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GTACTATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

50 TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 250 
GGCGACATCG GTGAAGAAGA CCGTTGGGCA ATCCACCGTG CAGCACCTTC * 3 00 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 4 00 

55 GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGCTGAG AAGTTCCGTG 650 

50 ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 700 

168 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

CTGGCCGGTA CTGAAGTATC TGCGCTGCTG GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 800 

TCACTTCTAC CAAAACCGGT TCTATCACCT CCG 833 



2) INFORMATION FOR SEQ ID NO: 2 82 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter asJburiae 

(B) STRAIN: ATCC 35953 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 



GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

25 TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

30 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGC CACCAG GAAACCGTCT GCGCGTTGCG 600 

35 CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 80 0 

CGGTTCTATC 810 

40 

2 ) INFORMATION FOR SEQ ID NO: 2 83 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ent erobacter cancerogenus 
55 (B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
60 CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 

169 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTAAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 20 0 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCC 300 

5 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTATT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 50 0 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

10 TGGTGTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACCGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CAACCAAAAC 8 00 

15 CGGTTCTATC A 811 



20 



2 ) INFORMATION FOR SEQ ID NO: 284 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13 047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

35 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

£0 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

50 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTAC C AAAAC 800 

CGGTTCTATC ACTTCCG 817 



55 2) INFORMATION FOR SEQ ID NO: 285 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

170 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



15 



20 



25 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 



AATGAGAGCC 
GCGTACCATC 
TTAAAGATCT 
GGCCGTATTA 
CGGCGAAGAA 
AGCTCTCCAG 
CTGATTTGCC 
GGGCGTTGGT 
TCGAGCACTC 
GAGGGTAACG 
AGTATCCCTG 
GCGTGGCGCT 
CGTGACGTTC 
TACTGAAGTA 
AGCCGACGCT 
ACCAAAACCG 



TGGTGCTGGA 
GCAATGGGTT 
CGAACATCCG 
TGAACGTGCT 
GAGCGTTGGG 
CTCTCAGGAA 
CGTTCGCGAA 
AAAACCGTAA 
CGGCTACTCC 
ACTTCTACCA 
GTGTACGGCC 
GACCGGTCTG 
TGCTGTTCGT 
TCCGCACTGC 
GGCGGAAGAG 
GTTCTA 



AGTTCAGCAG 
CTTCTGACGG 
ATCGAAGTCC 
GGGTCAGCCG 
CGATCCACCG 
CTGCTGGAAA 
GGGCGGTAAA 
ACATGATGGA 
GTGTTTGCGG 
CGAAATGACC 
AGATGAACGA 
ACCATGGCTG 
CGATAACATC 
TGGGCCGTAT 
ATGGGTGTTC 



CAGCTCGGCG 
TCTGCGTCGC 
CGGTAGGTAA 
GTTGATATGA 
CGCTGCGCCG 
CCGGTATCAA 
GTCGGTCTGT 
GCTGATCCGT 
GCGTGGGTGA 
GACTCCAACG 
GCCGCCGGGA 
AGAAATTCCG 
TACCGCTATA 
GCCTTCTGCA 
TGCAGGAACG 



GCGGTATCGT 50 

GGTCTGGAAG 100 

AGCGACCCTC 150 

AAGGCGATAT 200 

TCCTATGAAG 250 

GGTAATGGAC 300 

TCGGCGGTGC 350 

AACATCGCGA 400 

ACGTACTCGT 450 

TTATCGACAA 500 

AACCGTCTGC 550 

TGACGAAGGT 600 

CCCTCGCCGG 650 

GTAGGTTACC 700 

TATCACCTCC 750 

766 



30 2 ) INFORMATION FOR SEQ ID NO: 286 



35 



iO 



35 



50 



55 



50 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 



GCCGTACCAC 
CCTGGTGCTG 
TCGCCATGGG 
CTTGAGCACC 
CATGAACGTA 
AAGAGCGTTG 
AGCTCTCAGG 
TCCGTTTGCG 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTTTACGG 
CTGACTGGCC 
TCTGCTGTTC 
TATCTGCACT 



GCGTGTACGA 
GAAGTTCAGC 
TTCTTCCGAC 
CGATCGAAGT 
TTGGGTCAGC 
GGCTATCCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CCGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACGATGGC 
GTCGATAACA 
GCTGGGTCGT 



CGCGCTTGAG GTACAGAATG 
AGCAGCTCGG CGGCGGTATC 
GGTCTGCGTC GTGGTCTGGA 
CCCGGTAGGT AAAGCAACGC 
CAATCGACAT GAAAGGCGAC 
CGCGCGGCAC CTTCCTACGA 
AACCGGCATC AAAGTTATCG 
AAGTTGGTCT GTTCGGTGGT 
GAGCTGATCC GTAACATCGC 
GGGCGTGGGT GAACGTACTC 
CCGACTCCAA CGTTCTGGAC 
GAGCCACCAG GAAACCGTCT 
TGAGAAGTTC CGTGACGAAG 
TCTACCGTTA CACCCTGGCC 
ATGCCTTCAG CGGTAGGTTA 
171 



GTAACGAGAG 50 

GTGCGTACCA 100 

AGTGAAAGAC 150 

TGGGTCGTAT 2 00 

ATCGGTGAAG 250 

AGAGCTGTCC 300 

ACCTGATGTG 350 

GCGGGTGTAG 4 00 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCG 600 

GCCGTGACGT 650 

GGTACGGAAG 700 

TCAGCCAACG 750 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT " CGACCAAAAC 800 
CGGTT 805 



2) INFORMATION FOR SEQ ID NO: 2 87 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 791 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ente:rojbacte:r sakasakii 

(B) STRAIN: ATCC 2 9544 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 50 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

CCGACGGTCT GCGTCGCGGT CTGCCTGTTG CAGACCTTGA GCACCCGATC 150 

25 GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 200 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 250 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCCAGCTC TCAGGAACTG 300 

CTGGAAACCG GC AT C AAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 3 50 

CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 40 0 

30 TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 450 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 550 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 600 

ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

35 CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 700 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 750 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAACCGGTT C 791 



40 



50 



2) INFORMATION FOR SEQ ID NO: 288 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 839 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 

50 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 200 

172 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GATACTCTAG GTCGTGTGTT TAACGTATTA GGTGAAACGA TTGATAAAGA 250 

AGCACCTTTT CCAGAAGATG CAGAAAGAAG CGGCATTCAT AAAAAGGCGC 300 

CTGCTTTTGA AGACCTTAGT ACAAGTAACG AGATTTTGGA AACAGGGATC 350 

AAGGTTATCG ACTTATTAGC CCCTTATTTA AAAGGTGGGA AAGTCGGACT 4 00 

5 ATTCGGTGGT GCCGGTGTTG GTAAAACCGT TTTGATCCAA GAATTAATTC 450 

ATAATATCGC CCAAGAACAC GGTGGTATTT CAGTGTTTAC CGGTGTTGGG 500 

GAACGTACTC GTGAAGGGAA CGAC CTTTAT TATGAAATGA AAGACTCTGG 550 

CGTTATTGAG AAAACAGCCA TGGTGTTCGG ACAAATGAAC GAGCCGCCTG • 600 

GTGCACGTAT GCGTGTTGCC TTGACTGGTT TGACATTAGC TGAATATTTC 650 

10 CGTGATGAAG AAGGACAAGA TGTGTTGCTA TTTATTGACA ACATCTTCCG 700 

CTTTACTCAA GCCGGATCAG AAGTTTCTGC CTTATTAGGA CGTATGCCAT 750 

CAGCCGTTGG GTATCAACCA ACTTTGGCAA CTGAAATGGG GCAATTACAA 800 

GAACGAATCA CTTCAACCAA AAAAGGTTCG ATCACTTCA 83 9 



15 



30 



2) INFORMATION FOR SEQ ID NO : 2 89 



( i ) SEQUENCE CHARACTER I STICS : 
20 (A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



TTCTCTCTAG ACCAATCATT ACCAGATATC AACAATGCGT TGATTGTTTA 50 

CAAAAAAGAT GAGCAGAAAA CAAAAGTTGT GTTAGAAGCT GCCTTAGAAC 100 

35 TTGGCGACGG CGTTATCCGT ACGATTGCCA TGGAATCAAC AGATGGCTTA 150 

CAACGAGGAA TGGAAGTAAT CGATACAGGC GCCTCCATTT CTGTTCCAGT 2 00 

TGGGACAGAA ACCTTAGGAC GGGTGTTTAA TGTCTTAGGG GACACCATCG 250 

ATTTAGAAGC GCCGTTCCCT GAAGAAGCAC CCCGCAGTGG GATTCACAAA 3 00 

AAAGCACCTG ACTTTGATGA ATTGTCAACA AGTACGGAGA TCCTTGAAAC 3 50 

4 0 TGGGATCAAA GTTATCGATT TGTTAGCCCC TTATTTAAAA GGGGGGAAAG 4 00 

TTGGACTTTT CGGTGGTGCC GGTGTTGGTA AAACCGTCTT GATCCAAGAA 450 

TTGATCCACA ACATCGCCCA AGAGCATGGT GGGATCTCTG TCTTCACAGG 50 0 

TGTTGGTGAA CGGACACGTG AAGGAAATGA CCTTTATAAT GAAATGAAAG 550 

AATCTGGCGT TATCGAAAAA ACAGCCATGG TGTTTGGACA AATGAACGAA 600 

15 CCACCAGGTG CTCGGATGCG GGTAGCCTTG ACTGGTTTGA CATTAGCCGA 6 50 

GTACTTCCGT GATGTGGAAG GACAAGACGT GCTCTTGTTT ATCGATAATA 700 

TCTTCCGCTT CACTCAAGCA GGTTCTGAAG TATCTGCCTT ACTAGGTCGG 750 

ATGCCGTCTG CCGTTGGGTA TCAGCCAACA TTAGCAACTG AGATGGGGCA 800 

ATTACAAGAA CGGATCACAT CGACGAAGAA AGGTTCCGTT ACGTCTA 847 

50 



2) INFORMATION FOR SEQ ID NO: 290 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



173 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
5 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCTTT AGTTGTTTAT 50 

10 AAAAATGATG AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

AGGTGACGGT GTCATCCGTA CAATCGCGAT GGAATCAACA GATGGTTTAC 150 

AACG CGGAAT GGAAGTCATT GATACAGAAA AAGCAATTTC TGTACCAGTG 20 0 

GGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 250 

TTTATCTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 30 0 

15 AAGCACCAAA CTTTGATGAG TTAAGTACAA GTACTGAGAT CCTTGAAACT 350 

GGGATCAAAG TTATTGACTT GCTTGCTCCT TACTTAAAAG GTGGGAAAGT 400 

TGGATTATTC GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 450 

TGATCCATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 

GTTGGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 

20 TTCAGGAGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 60 0 

CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 65 0 

TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCCG GTTCAGAAGT TTCTGCCCTA TTAGGTCGTA 750 

TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 

25 TTACAAGAAC GGATCACTTC AACGAAAAAA GGTTCAATCA CTTCA 84 5 



30 



2) INFORMATION FOR SEQ ID NO: 291 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£0 (A) ORGANISM: Entero coccus faecalis 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 

15 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGCTTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC CTGTTGGTAA 2 00 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 3 00 

CCAGCATTTG ATGAATTAAG TACCAGTAAT GAAATTTTAG AAACAGGGAT 350 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 4 00 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 450 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 5 00 

55 TGAACGGACA CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AACATTTTCC 700 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

50 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 

174 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 840 



5 2) INFORMATION FOR SEQ ID NO: 292 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 50 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 2 00 

25 GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 2 50 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 3 00 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 3 50 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 4 00 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATC CAAGAAC 450 

3 0 TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

TATTTC CGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 70 0 

35 TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 800 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 831 



40 



50 



2) INFORMATION FOR SEQ ID NO: 293 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 6 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTAC CAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 100 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

60 TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 200 

\ 175 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGCCGAGTAT TTAATGTACT AGGCGATACT ATCGACTTGG AAGCACCAfy " 256" 

CCCAGAAGAT GCCAAACGTA GTGGCATCCA CAAAAAAGCC CCAGATTTCG 300 

ATGAATTGTC AACAAGTACA GAAATCCTTG AAACTGGGAT CAAAGTTATC 350 

GATTTATTAG CTCCTTACTT AAAAGGTGGT AAAGTCGGCT TGTTCGGTGG 4 00 

5 TGCCGGTGTT GGTAAAACCG TATTGATTCA AGAATTGATT CACAATATCG 450 

CTCAAGAGCA TGGGGGAATT TCAGTATTTA CCGGTGTTGG CGAACGGACG 500 

CGTGAAGGTA ATGACTTGTA TTATGAAATG AAAGAATCAG GCGTTATCGA 550 

AAAGACAGCC ATGGTTTTCG GTCAAATGAA TGAACCACCA GGTGCCCGGA 600 

TGCGGGTTGC TTTGACTGGT TTGACCATTG CTGAGTATTT CCGTGACGTT 650 

10 GAAGGACAAG ATGTGCTCTT GTTTATCGAT AATATTTTCC GTTTCACACA 700 

AGCGGGTTCT GAAGTATCTG CCTTGTTAGG CCGGATGCCA TCAGCCGTTG 750 

GTTATCAACC AACTCTAGCA ACTGAAATGG GTCAATTACA AGAACGAATC 800 

ACTTCTACGA AAAAAGGATC TGTAAC 826 



15 



2) INFORMATION FOR SEQ ID NO: 294 



(i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 



TTTCTTTGGA CCAATCCTTA CCAGACATCA ACAATGCGTT GGTGGTCTAT 50 

AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAGAACT 100 

35 TGGTGATGGT GTCGTACGTA CGATTGCTAT GTCGTCAACA GACGGTTTGC 150 

AACGTGGGAT GGAAGTCATC GATACAGGAG CATCAATTTC TGTTCCTGTT 200 

GGGAAAGAGA CATTAGGACG TGTATTTAAC GTTTTAGGGG AGACTATCGA 250 

CTTAGATGGT CCATTCCCAG AAGAAGTAGC ACGAGATGGT ATTCATAAAA 300 

AGGCACCTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA 350 

40 GGGATTAAAG TAATCGATTT ATTAGCGCCT TACTTAAAAG GTGGGAAAGT 400 

TGGTTTATTC GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT 450 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTGCGGGT 500 

GTTGGTGAGC GTACTCGTGA AGGAAATGAC CTTTATTATG AAATGAAAGA 550 

GTCGGGCGTT ATTGAGAAAA CAGCGATGGT TTTTGGACAA ATGAACGAAC 600 

15 CACCAGGTGC ACGTATGCGA GTTGCTTTAA CTGGTTTAAC CATTGCAGAA 650 

TACTTCCGTG ATGTTGAAGG ACAAGATGTA TTACTATTTA TTGATAACAT 700 

TTTCCGTTTT ACTCAAGCTG GTTCAGAAGT TTCAGCTTTA TTAGGACGTA 750 

TGCCTTCAGC GGTAGGGTAT CAACCGACAT TAGCAACAGA AATGGGACAA 800 

TTACAAGAAC GTATTACGTC AACGAAAAAA GGCTCAATTA CATCAA 84 6 

50 



2 ) INFORMATION FOR SEQ ID NO: 295 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

176 
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(ii) MOLECULE TYPE; Genomic DMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 3 5469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

15 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA* CGTTATCGAC AAAGTATCCC 550 

2 0 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



30 



2) INFORMATION FOR SEQ ID NO: 2 96 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

45 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 10 0 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 250 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 50 0 

55 TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 7 00 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

50 ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 800 

177 
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5 2) INFORMATION FOR SEQ ID NO: 2 97 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 80 8 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 



CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

CACCCGATCG AAGTGC CGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 200 

25 CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 250 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 300 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 3 50 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 450 

3 0 TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG ATGACCGACT CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 700 

35 CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 800 

TATCACCT 808 



40 



50 



2 ) INFORMATION FOR SEQ ID NO : 298 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 843 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium 1 en turn 

(B) STRAIN: ATCC 43055 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 

CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 100 

TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 150 

SO GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 200 

178 
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GGGTCCCGAG ACCCTGGGTC GCATCTGGAA 
ACGAGAAGCC GATGCCCGAG GTGAAGGGCT 
GCTCCGGACT ACGACGAGCT GTCCACCACC 
CATCAAGGCC ATCGACCTCG TCGAGCCCTT 
5 GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA 
ATCAACAACC TGGCCCAGGA GCACGGCGGC 
GGGCGAGCGT ACCCGCGAGG GTACCGACCT 
CGGGCGTCAT CAACAAGACC TGCCTCGTGT 
CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG 
10 CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT 
GCTTCACGCA GGCCGGCTCC GAGGTGTCCG 
TCTGCCGTGG GTTACCAGCC GACGCTGGCA 
GGAGCGCATC ACGTCGACGT CCACCGGCTC 

15 



PCT/CA00/01150 

CGTCATGGGC GAGCCCGTCG 250 

ACATGCCCAT CCACCGTCCG 300 

ACCGAGATCT TCGAGACCGG 350 

CGTCAAGGGC GGCAAGACGG 40 0 

CGGTTATCAT CCAGGAGCTC 450 

ACGTCGGTGT TCACGGGCGT 500 

CTACCTGGAG ATGAGCGACT 550 

ACGGTCAGAT GAACGAGCCT 600 

GGCCTCACCG AGGCGGAGTA 650 

GTTCGTGGAC AACATCTTCC 700 

CTCTGCTGGG CCGCATGCCC 750 

ACCGAGATGG GCGACCTGCA 800 

CATCACGTCC GTG 843 



2) INFORMATION FOR SEQ ID NO: 299 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS : Double 
(D) TOPOLOGY : Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella americana. 

(B) STRAIN: ATCC 33852 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

3 5 TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 2 00 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 2 50 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 300 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACCGGTAT CAAAGTTATG 3 50 

40 GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 400 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 4 50 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

45 TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 700 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 750 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 800 

TCTACCAAAA GTGGTTCTAT CACCTCCGT 829 

50 

2) INFORMATION FOR SEQ ID NO: 300 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



179 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 
5 (B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 

AACACGCCTA AAGTATATGA TGCTTTAAAT GTAGTAGAAG CTGGTTTAGT 50 

10 ATTAGAAGTT CAGCAACAAA TTGGTGATGG CGTAGTTCGT ACAATTGCTA 10 0 

TGGGATCTAG TGATGGTCTT AGACGTGGTA TGGAAGTTAA GAACACAAAT 150 

GCGCCTATTT CTGTTCCAGT TGGACATGGC ACACTTGGAC GTATCATGAA 2 00 

TGTTTTAGGT GAACCAATTG ATGAAGCTGG TCCAATTGAA TATACTGAGA 250 

AAAGATCTAT C CATC AAGCT CCTCCTGCAT ATGATGAGTT AGCATTAAGT 3 00 

15 ACAGAAATAT TAGAAACAGG TATCAAAGTA GTTGACCTTA TTTGTCCATT 350 

TGCTAAGGGC GGTAAAGTTG GTTTATTTGG CGGTGCAGGT GTTGGTAAAA 400 

CTGTAACGAT GATGGAACTT ATCAACAATA TTGCAAAAGA ACATAGTGGC 450 

TACTCTGTAT TTTCCGGTGT TGGTGAAAGA ACTCGTGAAG GTAATGACTT 500 

CTACTATGAG ATGAAATATT CTAATGTATT GGATAAAGTA TCATTAGTAT 550 

2 0 ATGGTCAGAT GAATGAGCCG CCTGGAAACA GATTAAGAGT AGCTCTTAGT 600 

GGCTTAACAA TAGCAGAAGG ATTCCGTGAT GAAAAGCGTG ATGTTTTGAT 650 

GTTTATCGAT AACATCTATC GTTATACATT AGCAGGTACA GAGGTATCGG 700 

CGCTACTTGG TCGTATGCCA TCTGCTGTGG GTTATCAGCC AACGCTTGCA 750 

GCTGAGATGG GTGCTTTACA " GGAGCGTATT ACATCTACTA AGACAGGATC 800 

25 TATTA 805 



30 



2) INFORMATION FOR SEQ ID NO: 301 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 <D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 2 5563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 301 

45 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 100 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 200 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 250 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 350 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG 400 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 450 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 650 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 700 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 

SO CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 800 

180 
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5 2) INFORMATION FOR SEQ ID NO: 302 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorxim subsp. 
necrophorum 

(B) STRAIN: ATCC 25286 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 10 0 

AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 150 

25 CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 2 00 

GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 250 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG . ACAGAAATTT 3 00 

TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 350 

GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 400 

3 0 TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 4 50 

TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 500 

ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGC CGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

3 5 GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 700 

GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 750 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



40 



2) INFORMATION FOR SEQ ID NO: 303 

( i ) SEQUENCE CHARACTERI STICS : 
45 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphum 

(B) STRAIN: ATCC 10953 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 



GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

50 CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 150 

181 
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GATACAGGTA AACCAATTAC AGTACCAGTT 
AATATTAAAT GTTTTAGGAG AACCTGTTGA 
CTGAAACATT TTTACCTATT CATAGAGAAG 
GAAACTGAAA CTGAAATATT TGAAACAGGA 
5 AGCACCATAT ATTAAAGGTG GAAAAATAGG 
TAGGAAAAAC AGTTTTAATA ATGGAACTTA 
CATGGAGGAA TTTCAGTTTT TGCAGGAGTT 
TAGAGACTTA TATGGTGAAA TGACTGAATC 
CTCTTGTTTA TGGACAAATG AATGAGCCAC 
10 GCATTAACAG GGCTTACTGT TGCAGAAAAC 
AGATGTTCTT CTATTTATAG ATAATATATT 
CAGAAGTTTC AGCTTTACTT GGAAGAATAC 
CCAAACCTAG CAACTGAAAT GGGTGCTTTA 
AAAATCTGGT TCAATTACAT C 

15 



GGTAAAGCTG TTCTTGGTAG 200 

TAATCAAGGT CCTATAAATG 250 

CACCAGAATT TGATGACTTA 300 

ATAAAAGTTA TAGACTTATT 350 

ATTATTTGGT GGAGCTGGAG 4 00 

TCAACAACAT TGCAAAAGGA 450 

GGAGAAAGAA CAAGAGAAGG 500 

AGGAGTTATC ACAAAAACAG 550 

CTGGAGCAAG ACTTAGAGTT 600 

TTTAGAGATA AAGATGGGCA 650 

TAGATTTACA CAAGCAGGTT 700 

CATCAGCTGT TGGATATCAA 750 

CAAGAAAGAA TAACATCTAC 800 

821 



2) INFORMATION FOR SEQ ID NO: 304 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 04 



TTCCCAGTTG 

35 CAACACCGTT 
AAGTTGAGCA 
CCTACGGACG 
AATTTCTGTG 
CTGGTAACAT 

40 CGCTGGCCAA 
GACTCAAATG 
ACGTTCAGGG 
ACTGTGTTGA 
TGTGTCTGTG 

45 TGATTGGCGA 
TTTGGTCAGA 
TGCTTTGACT 
TGCTGTTTAT 
TCCACGTTGC 

50 GGCGGATGAA 
ATTCTATTAC 



GCTATCTTCC 
GGAAACACGG 
GCACCTTGGT 
GCTTGGTCCG 
CCTGTTGGAG 
TTTAAACGCT 
TCCACCGCAA 
TTTGAAACAG 
CGGAAAGATT 
TTCAGGAGAT 
TTTGCTGGCG 
AATGGCTGAG 
TGGATGAGCC 
ATGGCTGAGT 
CGACAACATC 
TTGGTCGTAT 
ATGGGTGCGT 
GTCG 



AGATATTTAT 
AGGGAGATAC 
GATTCAACTG 
TGGTGCTTTA 
ATGTTACAAA 
AAACCAGGCG 
CCCACCTGCT 
GCATTAAGGT 
GGTCTGTTCG 
GATTCAGCGC 
TTGGCGAACG 
GCTGGCGTTT 
TCCTGGGACT 
ATTTCCGTGA 
TTCCGCTTTA 
GCCTTCTGCA 
TGCAGGAGCG 



AATGCTCTCA 
CGTCCACGAG 
TGCGAGCAGT 
GTGCGAGATA 
AGGTCACGTT 
AAAACATTGA 
TTCGATCAGC 
TATCGATTTG 
GTGGTGCAGG 
GTTGCACAGA 
TACTCGTGAG 
TGGAGAAAAC 
CGTCTTCGTG 
TGTTCAGAAT 
CTCAGGCAGG 
GTTGGTTATC 
CATTACTTCT 



AGGTTGATAT 50 

ATTACATTGG 100 

GGCACTTAAG 150 

CTGGTGGCCC 200 

TTTGACGTAA 250 

GGTGACCGAG 3 00 

TTGAGTCTAA 350 

CTTACGCCTT 400 

CGTTGGTAAA 450 

ACCACGGCGG 500 

GGTAACGATT 550 

AGCGCTTGTC 600 

TGCCTCTTAC 650 

CAGGATGTGT 700 

TTCTGAGGTT 750 

AGCCAAACTT 800 

ACGCGCGGTC 850 

864 



55 2) INFORMATION FOR SEQ ID NO: 305 

( i ) SEQUENCE CHARACTERI STICS : 

(A) . LENGTH: 84 8 bases 

(B) TYPE: Nucleic acid 
SO (c) STRANDEDNESS: Double 

182 
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WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella. haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 

10 

TCGAATCAGG GCATATGCCA AATCTATTAA ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 2 00 

L5 GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGT GAAGAAGCAG GAGCAGAAGT TCGTAAAGAT TCAATTCACA 300 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 3 50 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CCATATATTA AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 450 

2 0 AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATTCACA 500 

GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAACAGCCAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 700 

25 CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 750 

GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 848 
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40 



2) INFORMATION FOR SEQ ID NO: 3 06 



(i) SEQUENCE CHARACTERISTICS: ' 
(A) LENGTH: 84 8 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella. moirbillorum 

(B) STRAIN: ATCC 27824 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

TCGAATCAGG GCATATGCCT AATCTACTAA ACGCTTTAGA AGTTTATATA 50 

GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

50 TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 2 00 

GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 300 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 350 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA CCATATATTA AAGGTGGTAA 400 

55 GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 450 

AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 500 

GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 550 

AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 

AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 650 

50 GAGTACTTCC GTGATGAAGA AGGACAAGAC GTACTATTAT TTATCGATAA 700 
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TATCTTCCGT TTCACACAAG CAGGGTCTGA GGTATCTGCA TTATTAGGGC 75 0 

GTATGCCTTC AGCCGTTGGA TATCAACCAA CTCTTGCAAC AGAAATGGGA 800 
CGTCTTCAAG AAGGTATTAC ATCAAC TAAA AAAGGATCTG TTACATCT 848 



2) INFORMATION FOR SEQ ID NO : 3 07 

(i) SEQUENCE CHARACTERISTICS: 
L0 (A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

L5 (ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 



GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

25 TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 200 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 3 00 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 3 50 

3 0 TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 4 00 

CCGTTAATAT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 450 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 60 0 

35 GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 700 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 8 00 

AATCACGGCA GTA 813 

10 

2) INFORMATION FOR SEQ ID NO : 3 08 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
55 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

TGAATTTCCA CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
50 AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 
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CGTTGTATCG CATTAGGTGC 
AGAAAACACG AATGATCCGA 
GTCGTATCAT GAATGTATTG 
GGTGAAGAAG AGCGTTGGGC 
5 ACAATCAAAC AGTACGGAAT 
TAATTTGTCC ATTCGCAAAA 
GGTGTAGGTA AAACCGTTAA 
CGAGCACTCA GGTTACTCCG 
AAGGTAACGA CTTCTATCAT 

10 GTATCTTTGG TTTATGGTCA 
TGTTGCGTTA ACTGGTTTAA 
GTGATGTATT ATTCTTCGTG 
ACGGAAGTAT CTGCGTTATT 
ACCAACTCTT GCTGAAGAAA 

L5 CCAAAACAGG TTCTATTACA 



TTCTGACGGT TTAAAACGTG 
TTCAAGTACC GGTAGGCACA 
GGTGAACCAA TTGACGAACA 
TATCCATCGT TCTGCACCAA 
TATTAGAGAC TGGTATCAAA 
GGTGGTAAAG TTGGTCTATT 
CATGATGGAA TTAATCCGTA 
TATTTGCGGG TGTAGGTGAA 
GAAATGAAAG ATTCTAACGT 
GATGAATGAG CCACCAGGTA 
CCATGGCAGA AAAATTCCGC 
GATAATATCT ATCGTTATAC 
AGGTCGTATG CCATCTGCGG 
TGGGTGTGTT ACAAGAACGT 
TCTGTA 



PCT/CA00/01150 

GTTTAAAAGT 150 

AAAACCCTTG 2 00 

AGGTCCAATC 250 

GCTATGAAGA 3 00 

GTTATCGACT 350 

CGGTGGTGCG 40 0 

ACATCGCGAT 450 

CGTACTCGTG 500 

ATTAGATAAA 550 

ACCGTTTACG 600 

GATGAAGGTC 650 

CCTTGCTGGT 700 

TAGGTTACCA 75 0 

ATCACTTCAA * 800 

826 
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2) INFORMATION FOR SEQ ID NO: 309 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



35 



40 



45 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parahaemolyticus 
<B) STRAIN: ATCC 10014 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 



GATGCAGTAC 
GCTTGAAGTT 
TAGGTACGTC 
AACCCAATTG 
CGTATTGGGT 
GCTGGGCAAT 
ACGGAATTAC 
CGCAAAAGGG 
CCGTAAATAT 
TACTCTGTAT 
CTACCACGAA 
ATGGTCAAAT 
GGCTTAACCA 
CTTCGTCGAT 
CACTTCTCGG 
GAAGAAATGG 
TATCACCTC 



CAAAAGTATA 
CAACAACAAT 
TGATGGTTTA 
AAGTGCCAGT 
GAGCCGATTG 
CCACCGTGCA 
TCGAAACAGG 
GGTAAAGTTG 
GATGGAGTTA 
TTGCAGGGGT 
ATGACAGACT 
GAACGAACCA 
TGGCGGAAAA 
AACATCTACC 
TCGTATGCCA 
GTGTATTACA 



TGATGCGTTA 
TAGGCGGTGG 
AAACGTGGCT 
GGGCACTAAA 
ACGAAAAAGG 
GCACCAAGCT 
TATCAAAGTT 
GTTTATTTGG 
ATCCGTAACA 
AGGTGAGCGT 
CTAACGTATT 
CCAGGTAACC 
ATTCCGCGAT 
GTTATACCCT 
TCTGCGGTAG 
AGAGCGTATC 



AAAGTTGAAT 
CTTAGTGCGC 
TAAAAGTAGA 
ACCCTTGGTC 
TCCTATCGGT 
ACGAAGAGCA 
ATCGACTTAA 
TGGTGCAGGT 
TCGCGATCGA 
ACTCGTGAAG 
AGATAAAGTA 
GTTTACGCGT 
GAAGGTCGTG 
AGCAGGTACG 
GTTATCAGCC 
ACTTCAACCA 



CAGGTTTAAC 50 

TGTATCGCAT 100 

AAATACAGGC 150 

GTATTATGAA 2 0 0 

GAAGAAGCAC 250 

ATCAAATAGC 3 00 

TCTGCCCATT 3 50 

GTAGGTAAAA 4 00 

ACACTCTGGT 450 

GTAATGACTT 500 

TCGTTAGTGT 550 

AGCTTTAACA 600 

ACGTATTATT 650 

GAAGTGTCAG 7 00 

AACCTTAGCA 750 

AAACTGGTTC 8 00 

809 



55 2) INFORMATION FOR SEQ ID NO: 310 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

185 
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10 



WO 01/23604 PCT/CA00/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 10 0 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 20 0 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 3 00 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 400 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 4 50 

2 0 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 60 0 

TGTTGCTTTA AC CGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 700 

25 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 750 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATT AC AT CAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



30 



40 



2) INFORMATION FOR SEQ ID NO : 311 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 811 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 

GCCGTGCCTA AAGTGTATAA CGCACTTGAG GTGAAAGGCG GTGCCACTAA 50 

ACTGGTACTG GAAGTTCAGC AGCAGCTAGG CGGCGGCGTT GTACGCTGTA 100 

TCGCTATGGG TACTTCTGAC GGTCTGCGTC GCGGACTGGA CGTTGTTGAC 150 

50 CTGGAGCACC CGATTGAAGT CCCAGTAGGT AAAGCGACCT TAGGCCGCAT 20 0 

TATGAACGTA CTGGGTGAGC CAATTGATAT GAAGGGTGAT ATCGGCGAAG 250 

AAGATCGCTG GGCTATTCAC CGTGAAGCTC CAAGCTACGA AGAACTGTCT 3 00 

AACTCGCAAG AACTGCTGGA AACTGGTATC AAGGTAATGG ACCTGATTTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 400 

55 GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC CGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTATTGGAC AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACTATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

60 ACTGCTGTTC ATCGATAACA TCTACCGTTA TACCTTGGCC GGTACCGAAG 700 

186 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 75 0 

CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC 800 
GGGTTCAATC A 811 



2) INFORMATION FOR SEQ ID NO: 312 

( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) ' STRAIN: ATCC 23330 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 



GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 50 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

25 TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 200 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 250 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 30 0 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 35 0 

30 CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 400 

CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 45 0 

GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 500 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGCCAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAAC CGTTTG CGCGTTGCAT 60 0 

35 TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 65 0 

GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 70 0 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 80 0 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 831 



2) INFORMATION FOR SEQ ID NO: 313 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella, pneumoniae subsp. ozaenae 
55 (B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 50 
50 GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 
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CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG " GATGTAAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TACTGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 300 

5 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGAC CTGATG 3 50 

TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 4 00 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

LO CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATCCGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

L5 ACCGGTTCTA TC 812 



2) INFORMATION FOR SEQ ID NO: 314 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 
<B) TYPE: Nucleic acid 
(C) STRANDEDNESS : Double 

25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi> ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

IS ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 250 

20 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

15 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

188 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTACCATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 2 00 

L5 GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGAC AT C GGCGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 3 50 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

20 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 5 00 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGATAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCGCTG 600 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 7 00 

25 CCGCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATCACT TCC 813 
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2) INFORMATION FOR SEQ ID NO: 316 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 822 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola. 

(B) STRAIN: ATCC 33531 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

50 GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 3 00 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 4 00 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

50 CGTTCTGCTG TTCGTCGATA ACATCTATCG TTATACCCTG GCCGGTACTG 700 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 7 50 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCCACCAA 800 
AACCGGTTCT ATCACTTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 85 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 

AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 100 

TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 150 

CAACGCTGGG TCGTATCATG AACGTACTGG GTCAACCGGT TGACATGAAA 200 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 2 50 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 00 

TTATCGACCT GATGTGTCCG TTCGCCAAGG GCGGTAAAGT TGGTCTGTTC 3 50 

GGCGGTGCGG GTGTAGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 4 00 

CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 450 

GTACTCGTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 500 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 700 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 750 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 78 5 



2) INFORMATION FOR SEQ ID NO: 318 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 

CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 

CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 

TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 
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ATGAACGTAC TGGGTCAKCC AGTMGACATG AAAGGCGACA TCGGTGAASa "2 00* 

AGAGCGTTGG GCTATCCACC GCGCTGCACC TTCCTACGAA GAGCTGTCTA 2 50 

GCTCTCAGGA ATTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT 300 

CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG TTCGGTGGTG CSGGTGTTGG 350 

5 TAAAAC CGTA AACATGATGG AGCTGATCCG TAACATCGCG ATCGAGCACT 400 

CCGGTTACTC CGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGTAAC 4 50 

GACTTCTACC ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT 500 

GGTATATGGC CAGATGAACG AGCCACCGGG AAACCGTCTG CGCGTTGCTC 550 

TGACCGGTCT GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTA 60 0 

LO CTGCTGTTCG TCGATAACAT CTATCGTTAC ACCCTGGCCG GTACTGAAGT 650 

ATCTGCWCTG CTGGGTCGTA TGCCTTCAGC GGTAGGTTAC CAGCCGACCC 700 

TGGCGGAAGA GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAGACC 750 

GGTTCTATC 759 



L5 



10 



2) INFORMATION FOR SEQ ID NO: 319 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 



TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

35 TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 250 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

10 CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 4 50 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

15 CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 

50 



2) INFORMATION FOR SEQ ID NO: 320 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 50 0 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 70 0 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 321 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 

TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACfCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTC CAGTTGG AGAAGACACT 2 00 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 250 

CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 3 00 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 350 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 4 00 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 450 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 650 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 750 

TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 
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5 2) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella, pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 



TTCCTCGTGA TAGCGTGCCT AAAGT CAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 2 00 

2 5 CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 3 00 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 3 50 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 40 0 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 450 

3 0 AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 50 0 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

35 TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTTC CATA 824 



40 



50 



2) INFORMATION FOR SEQ ID NO: 323 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 818 bases 
4 5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33 999 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

60 CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 200 
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CATGAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG ' 2*50" 

AAGAGCGTTG GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 3 00 

GGCGCAACCG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 4 00 

5 GTAAAACCGT AAACATGATG GAGCTCATTC GCAACATTGC GACCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGAATCCAA CGTATTGGAC AAGGTGTCGC 550 

TGGTATACGG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

TTAACGGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

10 TCTGCTGTTT ATCGACAACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 750 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 

GGGGTCTATC ACCTCCGT 818 



15 



50 



2) INFORMATION FOR SEQ ID NO: 324 



(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria, monocytogenes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT ACCTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

35 TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 200 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 250 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 3 00 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 3 50 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 4 00 

40 AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 4 50 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 6 00 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

45 GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA 7 00 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



2) INFORMATION FOR SEQ ID NO: 325 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 50 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 2 00 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 2 50 

GAGCCAGGTC TTGGC CGTG A CGGTGAGCAG TGGGGCATCC ACCGCGATCC 300 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 35 0 

15 TTAAGGTCAT CGACCTGCTG ACC CCGTATG TTAAGGGCGG CAAGATCGGC 40 0 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 450 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 50 0 
GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG , 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

2 0 AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 650 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 7 00 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 82 8 

25 



2) INFORMATION FOR SEQ ID NO: 32 6 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella. wisconsensis 
£0 (B) STRAIN: ATCC 35 017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

i5 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 200 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 250 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 300 

50 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 350 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 4 00 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 450 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 550 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

50 GACCGGCTCT ATCACTTCCG TA 822 
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2) INFORMATION FOR SEQ ID NO: 327 

5 

'( i ) SEQUENCE CHARACTERI ST ICS : 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Branhamella catarrhal is 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 



20 CCGTGGCGAT 
AAACCACCCT 
ATTGCCATGG 
TTCAGGTGCA 
TTATGGATGT 

25 GAACAAAAAT 
AAATAGTACA 
GTCCATTTGC 
GGTAAGACCG 
CTCAGGTCTG 

30 ATGACTTCTA 
TTTACTCAGT 
AGGAAACCGT 
TCCGTGATGA 
TTCGTTGATA 

35 ACTTTTAGGT 
AAGAGATGGG 
ATTA 



GTCCCCCAAA 
TGAAGTCCAA 
GTTCTACCGA 
CCCATTTCGG 
CCTAGGTCGC 
GGTCCATTCA 
GAACTTTTAG 
CAAAGGTGGT 
TTAACATGAT 
TCGGTTTTTG 
CCATGAAATG 
CAAAAGTTGC 
CTGCGTGTTG 
AAAAGACGAA 
ATATTTATCG 
CGTATGCCAT 
CTTGCTACAA 



TCTTTGATGC 
CAACAGTTAG 
AGGCTTAAAG 
TAC CAGTCGG 
CCAATCGATG 
TCGTGAAGCA 
AAACAGGCAT 
AAAGTCGGTC 
GGAGCTTATC 
CTGGTGTGGG 
CAAGAAGCAG 
CATGGTTTAT 
CCTTAACTGG 
GCAACGGGCA 
TTACACATTG 
CTGCGGTAGG 
GAGCGTATCA 



ACTTCATGTT 
GTGATGGTGT 
CGTGGCTTGC 
TCAAGCAACA 
AAGCAGGTCC 
CCAAGTTATG 
CAAAGTGATT 
TGTTCGGTGG 
AATAATATCG 
TGAGCGTACT 
GCGTTGTTAA 
GGTCAGATGA 
TTTGACCATG 
AAGGCCGTGA 
GCAGGTACTG 
TTATCAGCCG 
CCTCCACCCA 



GATGGTACTG 50 

GGTGCGTACC 100 

CTGTCTCTAA 150 

CTGGGTCGCA 2 00 

GGTAAATGCT 250 

ATGAACAGTC 300 

GATTTGCTTT 350 

TGCTGGTGTT 400 

CCCTAAAACA 450 

CGTGAGGGTA 500 

TACCGAAGAT 550 

ATGAGCCACC 600 

GCAGAGTATT 650 

TGTTCTGCTG 700 

AGGTATCAGC 750 

ACTTTGGCCG 800 

ATCAGGCTCA 850 

854 



40 



2) INFORMATION FOR SEQ ID NO : 328 



( i ) SEQUENCE CHARACTERI ST I CS : 
(A) LENGTH: 831 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

50 CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT TTTAGGACAC CCAATCGATG AGGCAGGTCC GGTAGAGC^T 35C * 

AGTAACACTT GGGCGATTCA CCGTGAAGCG CCAAGCTATG ATGAACAATC 300 

AAACTCTACT GAACTTTTAG AAACCGGTAT TAAAGTAATT GACTTACTAT 350 

GCCCATTTGC TAAAGGTGGT AAAGTCGGTC TGTTCGGTGG CGCGGGTGTT 400 

5 GGTAAAACCG TTAACATGAT GGAACTTATC AATAACATCG CAAAAGCACA 450 

CTCAGGTTTA TCGGTATTTG CTGGTGTAGG TGAGCGTACT CGTGAAGGTA 500 

ATGACTTCTA CCACGAGATG AAAGACTCAA ACGTACTTGA TAAAGTTGCG 550 

ATGGTGTATG GTCAGATGAA TGAGCCACCA GGAAACCGTT TACGTGTTGC 600 

CCTGACAGGT TTAACCATGG CAGAATACTT CCGTGACGAA AAAGATGAAA 650 

10 ACGGTAAAGG TCGTGACGTA TTATTGTTCG TTGACAATAT TTATCGTTAC 700 

ACGCTAGCGG GTACCGAAGT ATCAGCATTA TTAGGTCGTA TGCCATCTGC 750 

AGTAGGGTAT CAGCCAACGC TTGCAGAAGA GATGGGTGTA CTACAAGAAC 800 

GTATTACTTC AACCCAATCA GGCTCTATTA C 831 



15 



30 



2 ) INFORMATION FOR SEQ ID NO: 329 



{ i ) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 32 9 



CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

35 GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 300 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

40 TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCTGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

J 5 CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 

50 



2) INFORMATION FOR SEQ ID NO: 330 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CCATCGCAAT GGGTACGTCT GACGGCCTGA AGCGTGGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 2 50 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 3 00 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA CTGCTGGAAA CCGGCATCAA 3 50 

GGTTAT CGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 450 

AACATCGCGG CTGAACACTC AGGTTACTCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 550 

2 0 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG ACCATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TAC CGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGCCAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 800 

25 TATTACCTCC ACCAAGACCG GTTC 824 



2) INFORMATION FOR SEQ ID NO: 331 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£0 (A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 1458 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

4 5 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 200 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 250 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 300 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 350 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 450 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG ' GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGCGGTAGGT TATCAGCCAA 750 

50 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 

198 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
ACCGGTTC 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 332 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 50 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 100 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 150 

AACCCAATTT CTGTTCCAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 2 00 

25 CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GCAGAAGAGA 250 

ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA ATCTAACAGT 3 00 

ACTGAACTTT TAGAAACGGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 3 50 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 400 

CCGTCAATAT GATGGAATTA ATC CGTAAC A TCGCAATTGA GCACTCAGGT 450 

30 TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 500 

CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 60 0 

GGCTTAACTA TGGCGGAAAA ATTCCGTGAT GAAGGTCGTG ATGTCTTATT 65 0 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA GAAGTTTCTG 70 0 

35 CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 75 0 

GAAGAAATGG GTGTTCTGCA AGAGCGTATT ACCTCAACCA AAACAGGTTC 800 

TATTA 805 



10 



50 



2) INFORMATION FOR SEQ ID NO: 3 33 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 82 8 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontlum 

(B) STRAIN: ATCC 49100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 200 

199 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

TTGGTGAAGA AGAGCGTTGG GCTATTCACC GTGAAGCACC AAGTTATGAA 3 00 

GATTTATCTG GCGCCAATGA ACTGCTGGAA ACGGGTATCA AGGTTATCGA 350 

CCTGATTTGT CCGTTTGCTA AAGGTGGTAA AGTTGGTCTG TTTGGTGGTG 4 00 

5 CGGGTGTAGG TAAAACCGTA AACATGATGG AGCTGATTCG TAACATTGCG 450 

ACTGAGCACT CAGGTTACTC CGTATTCGCC GGTGTAGGGG AACGTACCCG 500 

TGAAGGTAAT GACTTCTACC ACGAAATGAC CGAATCAAAC GTACTGGATA 550 

AAGTATCTCT GGTTTATGGC CAGATGAACG AGCCACCAGG AAACCGTCTG 60 0 

CGCGTGGCGT TAACGGGTTT GACCATGGCT GAAAAATTCC GTGATGAAGG 650 

LO TCGTGACGTT CTGTTATTTA TCGATAACAT TTATCGCTAT ACCTTAGCCG 700 

GTACCGAAGT ATCAGCACTG TTGGGGCGTA TGCCATCAGC GGTAGGTTAT 750 

CAGCCAACGT TAGCAGAAGA GATGGGTGTG TTGCAGGAAC GTATTACTTC 8 00 

AACCAAAACC GGTTCAATCA CTTCTGTA 82 8 



L5 
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2) INFORMATION FOR SEQ ID NO: 334 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 100 

35 CAATGGGTAC ATCAGACGGT TTAAG CCGTG GCTTAAAGGT TGAAGATTTA 150 

GGCCACCCAA TTGAAGTACC AGTAGGTAAA G CGACTTT AG GACGTATCAT 200 

GAACGTTCTG GGTACACCTA TTGATATGAA AGGTGAGATT GAAAC CGAAG 250 

AGCGTTGGTC AATCCACCGT GAAGCACCAA CTTACGAAGA GTTATCAAAC 300 

TCTCAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 350 

10 ATTTGCTAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 400 

AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACTCA 450 

GGTTACTCTG TATTTGCTGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

CTTCTATCAT GAAATGACAG ATTCTAACGT TCTTGACAAA GTATCGTTAG 550 

TTTACGGTCA GATGAATGAG CCAC CAGGAA ACCGTCTGCG TGTTGCACTG 600 

15 ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

GTTATTCGTC GATAAC AT CT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 700 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA CCAAAACAGG 800 

TTCTATC 8 07 

50 



2) INFORMATION FOR SEQ ID NO : 335 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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( ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 
5 (B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 

CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

LO TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

ATGGGTACAT CAGACGGTTT AAGCCGTGGC TTAAAAGTTG AAAACTTAGG 150 

CCACCCAATT GAAGTACCAG TAGGTAAAGC AACACTGGGA CGTATCATGA 2 00 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 2 50 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 3 00 

15 TCAAGAACTA CTAGAAACCG GTATCAAAGT AATGGACTTA ATCTGTCCGT 3 50 

TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 40 0 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 450 

TTATTCTGTA TTTGCAGGTG TTGGTGAGCG TACTCGTGAG GGTAACGACT 500 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

2 0 TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 60 0 

GGGTTTAACC ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

TATTCGTCGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 700 

GCACTGTTAG GCCGTATGCC ATCAGCAGTA GGTTACCAAC CAACATTGGC 750 

TGAAGAGATG GGTGTTCTGC AAGAACGTAT CACTTCAACC AAAACAGGTT 8 00 

25 CAATCACCTC T 811 



30 



2) INFORMATION FOR SEQ ID NO: 33 6 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
*0 (A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 98 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 36 

45 TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 2 00 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 250 

50 GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 3 00 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 4 50 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 500 

55 AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 700 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 750 

50 ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

201 
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5 2) INFORMATION FOR SEQ ID NO: 337 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



20 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 10 0 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 200 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 250 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 3 50 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 4 50 

3 0 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 5 00 

TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 6 00 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

35 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 83 0 



2) INFORMATION FOR SEQ ID NO: 338 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

50 TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 200 

202 
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CATGAACGTT CTGGGTGACC CTATTGATAT GAAAGGTGAT ATCGGCGAAG 25 C 

AAGAGCGCTG GTCTATTCAC CGTTCAGCGC CAAGCTATGA AGAATTAGCT 3 00 

AACTCAACAG AACTGCTAGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCATTCGCG AAAGGTGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 400 

5 GTAAAACAGT AAACATGATG GAACTGATCC GTAACATCGC GATTGAGCAC 450 

TCAGGTTACT CAGTATTCGC TGGTGTTGGT GAGCGTACCC GTGAAGGTAA 500 

CGACTTCTAT CATGAAATGA CTGATTCTAA CGTTCTGGAT AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGTGTTGCG 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGACGAAG GTCGTGACGT 650 

LO ACTGCTGTTC GTTGACAACA TTTATCGTTA TACACTGGCA GGTACTGAAG 70 0 

TATCAGCACT GTTAGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACA 750 

TTGGCAGAAG AGATGGGTGT TCTACAAGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC AC 812 



L5 
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2) INFORMATION FOR SEQ ID NO: 339 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia. stuartil 

(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 



TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

35 CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 2 00 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 300 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 350 

10 TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 400 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATT CAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

15 CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 8 00 

CTAAGACAGG TTCTATCAC 819 

50 



2 ) INFORMATION FOR SEQ ID NO: 340 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

203 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 
5 - (B) STRAIN: ATCC 2333 3- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

AACCGCACTG ACGTGCCTCA AATTTATGAC GCGTTAGTTG TAGATGGTAC 50 

10 CGAAACCACC CTAGAAGTTC AGCAGCAGCT GGGCGATGGT GTGGTACGTA 100 

CTATTGCAAT GGGATCTACT GAAGGTCTTA AGCGTGGTTT ACCAGTAACA 150 

AACACTGGTG CCCCAATTAC AGTTCCAGTA GGTGATGCGA CTTTAGGTCG 200 

TATTATGGAC GTTTTAGGTC GTCCAATTGA CGAACAAGGT CCAGTTAATT 2 50 

CTGAAGACCA TTGGTCAATC CACCGTCAAG CGCCATCATA TGATGAGCAA 300 

15 GCTAACAGTA CTGACCTATT AGAGACAGGT ATTAAAGTAA TTGACTTACT 350 

TTGTCCGTTC GCTAAAGGGG GTAAAGTTGG TCTGTTCGGT GGTGCCGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAATTGA TTAATAACAT CGCTCTTAAG 450 

CACTCAGGTT TATCAGTATT CGCTGGTGTG GGTGAGCGTA CTCGTGAAGG 500 

TAACGACTTC TACCACGAGA TGCAAGAAGC GGGTGTTGTT GACGTTGAAA 550 

2 0 ACTTCACCAA CTCAAAAGTT GCGATGGTTT ATGGTCAGAT GAATGAGCCA 600 

CCAGGTAACC GTTTACGTGT TGCGTTAACC GGTCTGACTA TGGCTGAGTA 650 

CTTCCGTGAT CAAAAAGATG AAAACGGTAA AGGTAAAGAC GTTCTATTAT 700 

TCGTTGATAA CATCTACCGC TACACGCTAG CCGGTACTGA AGTATCAGCA 750 

CTTCTAGGTC GTATGCCATC AGCAGTAGGT TATCAGCCAA CACTAGCGGA 8 00 

25 AGAGATGGGT GTACTACAAG AGCGTATTAC TTCAACTCAG ACTGGTTCTA 850 

TTACTTC 907 



30 2 ) INFORMATION FOR SEQ ID NO; 341 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 
<B) STRAIN: DSM 4 594 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



GTTCCCTCAG GATGCAGTAC CGAACGTGTA CAATGCTCTT GAGGTAGAAA 50 

ACGGTACCTC CAAACTGGTG CTGGAAGTTC AGCAACAGTT AGGCGGCGGC 100 

GTTGTTCGTT GTATCGCAAT GGGTACCTCA GACGGCCTGC GTCGCGGTCT 150 

GAAAGTGAAC AACCTGGAAC ACCCAATTGA AGTACCGGTA GGTAAAGCGA 2 00 

50 CTCTGGGTCG TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAAGGT 250 

GAAATCGGCG AAGAAGAACG TCGTGCTATT CACCGTGCTG CGCCTTCTTA 300 

TGAAGAGCTG GCAAACTCCC AGGAATTGCT GGAAACCGGT ATCAAAGTTA 350 

TGGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT . 4 00 

GGTGCGGGTG TAGGTAAAAC TGTGAACATG ATGGAGCTGA TCCGTAACAT 450 

55 TGCGATCGAG CACTCCGGTT ATTCTGTGTT TGCAGGCGTG GGTGAACGTA 500 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACTGATTC CAACGTTATC 550 

GACAAAGTTT CCCTGGTGTA TGGCCAGATG AATGAGCCAC CAGGTAACCG 600 

TCTGCGCGTT GCACTGACCG GCCTGACCAT GGCGGAAAAA TTCCGTGATG 650 

AAGGTCGTGA CGTACTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG 700 

60 GCCGGTACCG AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG 750 

204 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TTATCAGCCA ACGCTGGCGG AAGAGATGGG CGCTCTGCAA GAACGTATCA' 800 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG TA 832 



2 ) INFORMATION FOR SEQ ID NO: 342 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

25 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

30 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGC CAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

35 AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYC AS CCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



40 



2) INFORMATION FOR SEQ ID NO: 343 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 
55 (B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
60 GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 

205 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CCATCGCGAT 
GATCTCGAAC 
TATCATGAAC 
AAGAAGAGCG 
5 TCAAACTCTC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 

10 CCCTGGTGTA 
GCACTGACCG 
TGTACTGCTG 
AAGTATCCGC 
ACTCTGGCGG 

15 AACCGGTTCT 



GGGGTCTTCT 
ACCCGATCGA 
GTCCTGGGCG 
TTGGGCGATT 
AGGAACTGCT 
GCGAAGGGCG 
CGTAAACATG 
ACTCAGTGTT 
TACCACGAAA 
TGGCCAGATG 
GCCTGACCAT 
TTCGTCGATA 
ACTGCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



GACGGTCTGC 
AGTCCCGGTA 
AACCGGTCGA 
CACCGCGCAG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTTA 
TGCGGGCGTA 
TGACCGACTC 
AACGAGCCGC 
GGCGGAGAAA 
ACATCTATCG 
CGTATGCCTT 
CGTTCTGCAG 



GTCGCGGTCT " 

GGTAAAGCCA 

CATGAAAGGC 

CACCTTCCTA 

ATCAAAGTTA 

TCTGTTCGGT 

TTCGTAACAT 

GGGGAACGTA 

CAACGTTATC 

CGGGAAACCG 

TTC CGTGACG 

TTACACCCTG 

CCGCAGTAGG 

GAACGTATCA 



GGATGTAAAA 150 

CGCTGGGTCG 200 

GAGATCGGCG 250 

CGAAGAGTTG 300 

TCGACCTGAT 3 50 

GGTGCGGGTG 400 

CGCGATCGAG 450 

CTCGTGAGGG 500 

GATAAAGTAT 550 

TCTGCGCGTT 60 0 

AAGGTCGTGA 650 

GCCGGTACGG 70 0 

TTACCAGCCG 750 

CCTCCACCAA 800 

820 



2) INFORMATION FOR SEQ ID NO: 344 



20 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Salmonella, choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

35 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 250 

40 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

45 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 60 0 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

50 TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 2) INFORMATION FOR SEQ ID NO: 345 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

206 



SUBSTITUTE SHEET (RULE 26) 



10 



30- 



40 



WO 01/23604 PCT/CA00/01150 
(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43 974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 20 0 

15 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT -3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

20 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 50 0 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 70 0 

25 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGC CGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGT AT CAC 80 0 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 346 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 9 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double- 
CD) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

50 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

55 GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 50 0 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 60 0 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

60 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

207 



SUBSTITUTE SHEET (RULE 26} 



WO 01/23604 PCT/CA00/01150 

CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT * 7 St 

TATCAGCCAA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTAT CAC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCG 82 9 



2 ) INFORMATION FOR SEQ ID NO: 347 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp, choleraesuis 
serotype Paratyphi A 
20 (B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

25 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

30 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG AC CTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

35 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

40 CGGTTCTATC ACCTCCG 817 



45 



60 



2) INFORMATION FOR SEQ ID NO: 348 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 

208 



SUBSTITUTE SHEET {RULE 26) 



10 



15 



WO 01/23604 

GCCGTACCAC 
GCTGGTGCTG 
TCGCGATGGG 
CTCGAACACC 
CATGAACGTC 
AAGAGCGTTG 
AACTCTCAGG 
TCCGTTCGCG 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTC 
ACTGCTGTTC 
TATCCGCACT 
CTGGCGGAAG 
CGGTTC 



GCGTGTACGA 
GAAGTTCAGC 
GTCTTCTGAC 
CGATCGAAGT 
CTGGGCGAAC 
GGCGATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTCGATAACA 
GCTGGGCCGT 
AGATGGGCGT 



TGCCCTTGAG 
AGCAGCTTGG 
GGTCTGCGTC 
CCCGGTAGGT 
CGGTCGACAT 
CGCGCAGCGC 
AACCGGTATC 
AAGTCGGTCT 
GAGCTTATCC 
GGGCGTAGGT 
CCGACTCCAA 
GAGCCGCCGG 
GGAGAAATTC 
TCTATCGTTA 
ATGCCTTCCG 
TCTGCAGGAA 



GTGCAGAATG 
CGGCGGTATT 
GCGGTCTGGA 
AAAGCCACGC 
GAAAGGCGAG 
CTTCCTACGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTATCGAT 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CAGTAGGTTA 
CGTATCACCT 



PCT/CA00/01150 

GTAATGAGAA £0 

GTGCGTACCA 100 

TGTAAAAGAT 150 

TGGGTCGTAT 2 00 

ATCGGCGAAG 250 

AGAGTTGTCA 3 00 

ACCTGATGTG 350 

GCGGGTGTGG 4 00 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCA 600 

GTCGTQACGT 650 

GGTACGGAAG 70 0 

CCAGCCGACT 750 

CCACCAAAAC 80 0 

806 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 34 9 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

40 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

45 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

50 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 3 50 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 823 bases 

209 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 
10 (B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

15 TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

2 0 GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT AC CATGAAAT GACCGACTCC AACGTTATCG 550 

25 ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

30 CTCCACCAAG ACCGGTTCTA TCA 823 



2) INFORMATION FOR SEQ ID NO: 351 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Typhimurium 
(B) STRAIN: ATCC 14028 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TTGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

55 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

60 GCGATCGAGC ACTCCGGTTA CTCAGTGTTT GCGGGCGTAG GGGAACGTAC 500 

210 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TCGTGAGGGT AACGACTTCT 
ATAAAGTATC CCTGGTGTAT 
CTGCGCGTTG CATTGACCGG 
AGGTCGTGAC GTACTGCTGT 
5 CCGGTACGGA AGTATCCGCA 
TACCAGCCGA CTCTGGCGGA 
CTCCACCAAA ACCGGTTCTA 



ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
TCTGACCATG GCGGAGAAAT 
TCGTCGATAA CATCTATCGT 
CTGCTGGGCC GTATGCCTTC 
AGAGATGGGC GTTCTGCAGG 
TCACCTCCGT A 



PCT/CA00/01150 

GGGAAACCGT 600 
TCCGTGACGA 650 
TACACCCTGG 700 
CGCAGTAGGT 750 
AACGTATCAC 800 

831 



10 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 352 



( i ) SEQUENCE CHARACTERI S TI CS : 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 



GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

30 TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

35 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

40 TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 810 

45 



2) INFORMATION FOR SEQ ID NO: 353 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 
60 (B) STRAIN: ATCC 33105 

211 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 GAAACTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTCGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GACGGTCTGC GTCGCGGTCT GAAAGTGAAC 150 

AACCTGGAAC ACCCGATTGA AGTGCCGGTG GGTAAAGCTA CCCTGGGCCG 20 0 

TATCATGAAC GTATTGGGCG AACCAATCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTCCTG CGCCAAGCTA CGAAGAGCTG 300 

10 TCCAACTCCC AGGAC C TGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

TTGTCCGTTC GCCAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TGGGCAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCCGTGTT TGCGGGCGTG GGCGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTTCTG GACAAAGTAT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCGC CGGGTAACCG TCTGCGCGTT 600 

GCATTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGC CGCGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CCGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCC ATCACCTCCG 820 



25 



2) INFORMATION FOR SEQ ID NO: 354 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

40 GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 150 

GACC TGCAGC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 250 

45 AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 300 

TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

55 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACC 816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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{ i ) SEQUENCE CHARACTERS STI CS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 

(B) STRAIN: ATCC 14460 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 



15 



GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGTCACA 150 

GACCTGGACC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

20 TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 250 

AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 3 00 

GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

25 CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 55 0 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 65 0 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

30 AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 356 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 819 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 

ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 150 

55 ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 2 00 

ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 3 00 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 4 00 

60 TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 
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ACTCCGGTTA TTCCGTGTTT GCAGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GAACGACTCC AACGTACTGG ACAAAGTATC 550 

CCTGGTTTAC GGCCAGATGA ACGAGCCACC GGGTAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGACGA AGGCCGCGAC 650 

GTTCTGCTGT TCGTTGATAA CATTTACCGT TATACCCTGG CCGGTACCGA 700 

AGTGTCCGCA CTTCTGGGCC GTATGCCATC TGCGGTAGGT TATCAGCCAA 750 

CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC CTCTACCAAG 800 

ACCGGTTCTA TCACTTCCG 819 



2) INFORMATION FOR SEQ ID NO: 3 57 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 



TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

3 0 CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 2 00 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 250 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 300 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 350 

35 TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 400 

GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 450 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

40 CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 750 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 

45 

2) INFORMATION FOR SEQ ID NO: 358 

50 ( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorifera 
60 (B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 AACTCTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTGGTTCGTT 100 

GTATCGCTAT GGGCACCTCC GACGGTTTGC GTCGCGGCCT GAAAGTGAAC 150 

GATCTGCAAC ACCCAATCGA AGTCCCGGTT GGCAAGGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTATTGGGTC AACCAATCGA CATGAAAGGC GACATCGGCG 2 50 

AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCAAGCTA CGAAGAACTG 300 

10 TCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTTA TGGACCTGAT 350 

CTGCCCGTTT GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCAGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTACTG GACAAGGTTT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCCGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCTGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCT ATCACCTCCG TA 822 



25 



2) INFORMATION FOR SEQ ID NO: 359 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia plymuthica 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

40 GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 200 

GGGTGAACCA ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

45 CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 300 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 3 50 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 4 00 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

50 CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 700 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

55 ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 800 

CTCCG 805 



60 2) INFORMATION FOR SEQ ID NO: 360 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 150 

AAAGTTAACG ACCTCGAGCA CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

2 0 GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCAAGCTAC 300 

GAAGAGCTGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 350 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAAC C GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

25 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTAC CGT TACACCCTGG 700 

3 0- CCGGTACCGA AGTGT CCGCA CTGCTCGGCC GTATGCCATC TGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC 800 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



2 ) INFORMATION FOR SEQ ID NO: 3 61 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 406 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(B) STRAIN: LCDC D7172 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

CCGTAAACAT GATGGAACTG ATCCGTAACA TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 150 

55 ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 2 00 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 250 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 300 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 350 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 400 

60 GATCAC 406 
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2) INFORMATION FOR SEQ ID NO: 362 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 92 07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

2 0 TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGT CTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 250 

25 AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 5 00 

30 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT T AC A C C C TGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

35 TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTT CTGCAGG AACGTATCAC 800 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



40 2 ) INFORMATION FOR SEQ ID NO: 363 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 802 bases 
(B> TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 



GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 2 00 

60 CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 2 50 
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AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTCATTC GTAACATCGC GATCGAGCAC 450 

5 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

10 TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACC 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CG 802 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 3 64 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 819 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

35 ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTC CGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

40 AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

45 GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCACCTCCG 819 



2) INFORMATION FOR SEQ ID NO: 365 



( i ) SEQUENCE CHARACTERI STICS : 
55 (A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 100 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 2 00 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 250 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 350 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 600 

2 0 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 7 00 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 
TT 



25 



2 ) INFORMATION FOR SEQ ID NO: 3 66 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 



802 



TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 2 00 

ATGTACTAGG TGAAACAATT GACCTTAAAG AAGAAATTAG TGATTCTGTT 250 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 300 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 350 

50 CTTATATTAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 4 00 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 450 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 700 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 785 



60 
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2 ) INFORMATION FOR SEQ ID NO : 367 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33 753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 

GAACATAACG AAGTGCCTAA TATTAATAAC GCCTTAGTAC TCGATGTTGA 50 

AAGAGAAGAC GGAACAGTGT CTTTAACTTT AGAAGTAGCT TTACAATTAG 100 

GCGATGACGT TGTTCGTACC ATTGCAATGG ATTCAACTGA TGGTGTTAAA 150 

CGTGGTAACG AAGTCAAAGA TACTGGTAAT AGCATTAGCG TACCAGTCGG 2 00 

AGACGAAACT TTAGGACGTG TCTTCAACGT TCTAGGTGAA ACAATTGATT 2 50 

TAGAAGATAA ACTTGATGAT TCTGCGCGAC GTGACCCTAT ACATAGAGAA 3 00 

GCGCCAGCGT TTGATCAATT ATCAACTCAA GTTGAAATTT TAGAAACAGG 350 

AATTAAAGTT GTTGACTTAT TAGCACCTTA TATTAAAGGT GGTAAAGTTG 4 00 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTTTAAT CCAAGAATTA 450 

ATCAACAACA TCGCTCAAGA ACACGGTGGT ATTTCAGTCT TTGCCGGTGT 500 

AGGTGAACGT ACACGTGAAG GTAACGACTT GTACTATGAA ATGAGCGACA 550 

GTGGTGTAAT CAAGAAAACA GCCATGGTCT TCGGACAAAT GAACGAACCA 600 

CCTGGCGCAC GTATGCGTGT TGCTTTATCT GGTTTAACAA TGGCTGAATA 650 

TTTCCGTGAT GAACAAGGAC AAGACGTATT GTTATTCATC GACAATATTT 700 

TCCGTTTCAC ACAAGCCGGT TCAGAAGTTT CTGCCTTACT AGGTCGTTTA 750 

CCATCAGCCG TTGGTTATCA ACCTACATTA GCAACAGAAA TGGGACAATT 800 

ACAAGAACGT ATTACTTCAA CAACAAAAGG ATCAGTTACT TCA 843 



2 ) INFORMATION FOR SEQ ID NO: 3 68 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 849 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68 

GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATCGAA 50 

GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG TTGCACTTCA 100 

ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 150 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 200 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 250 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 300 

GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA AATCTTAGAA 350 
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ACAGGTATCA AAGTAGTAGA CTTATTAGCA 
AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
5 CGATAGTGGT GTAATTAAGA AAACAGCGAT 
AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGGCCAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGCCTTC AGCCGTTGGT TATCAACCAA 
10 CAATTACAAG AACGTATTAG TTCAACTAAT 



CCTTACATTA AAGGTGGTAA 4 0CT 

TAAGACAGTT TTAATCCAAG 45 0 

GTGGTATTTC AGTATTCGCC 500 

GACCTTTACT ATGAAATGAG 550 

GGTATTCGGT CAGATGAACG 6 00 

TATCAGGTTT AACAATGGCA 65 0 

GTATTATTAT TCATTGATAA 70 0 

AGTTTCAGCA TTACTTGGAC 750 

CACTTGCTAC TGAAATGGGT 80 0 

AAAGGTTCTG TTACTTCAA 849 



15 



2) INFORMATION FOR SEQ ID NO: 3 69 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

3 0 GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 2 00 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

3 5 ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 300 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 350 

CATTGATTTA TTAGCACCAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 40 0 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 4 50 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

40 TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 65 0 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 70 0 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 7 50 

45 GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 8 00 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 830 



50 2) INFORMATION FOR SEQ ID NO: 370 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidz's 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 

5 

AATAATGCAT TACACATCGA AGTTCCTAAA GAAGATGGAG CGCTTCAATT 50 

AACATTAGAA GTTGCACTTC AACTAGGTGA CGATGTAGTT CGTACAATTG 100 

CAATGGACTC AACTGACGGC GTTCAAAGAG GAATGGAAGT TAAAGATACA 150 

GGTAGAGACA TAAGTGTACC TGTCGGTGAC GTAACTCTAG GAAGAGTGTT 2 00 

10 TAACGTACTA GGAGAAACTA TTGACTTAGA TGAAAAAATT GATGATTCAG 250 

TACGACGTGA CCCTATCCAT AGACAAGCTC CAGGATTCGA CGAATTATCA 300 

ACAAAAGTAG AAATCTTAGA AACTGGTATT AAAGTAGTAG ACTTATTAGC 350 

ACCTTACATA AAAGGTGGTA AAATTGGATT ATTTGGTGGT GCCGGTGTAG 4 00 

GTAAAACCGT ACTAATCCAA GAACTTATTA ATAACATCGC TCAAGAACAC 4 50 

15 GGTGGTATCT CAGTATTCGC TGGTGTTGGT GAACGTACAC GTGAAGGTAA 50 0 

TGATCTTTAC TATGAAATGA GTGACAGTGG TGTTATCAAG AAAACTGCAA 550 

TGGTCTTTGG TCAAATGAAT GAGCCACCTG GTGCACGTAT GCGTGTAGCA 600 

TTATCCGGAT TAACAATGGC CGAATATTTC CGAGATGAAG AAGGCCAAGA 650 

TGTGTTATTA TTCATTGATA ACATTTTCAG ATTCACTCAA GCTGGTTCAG 70 0 

20 AAGTTTCTGC GTTATTAGGT CGTATGCCAT CAGCTGTTGG TTATCAACCT 750 

ACACTTGCTA CAGAAATGGG TCAATTACAA GAACGTA 787 



25 2) INFORMATION FOR SEQ ID NO: 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 



GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 200 

45 TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 250 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 300 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 350 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 4 00 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 450 

50 ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 6 00 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 65 0 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 700 

55 CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 83 0 
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2) INFORMATION FOR SEQ ID NO: 372 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus homlnis subsp. hominis 

(B) STRAIN: ATCC 2 7844 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 

TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAA.TTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 15 0 

2 0- AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 25 0 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 3 00 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 

25 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 50 0 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 60 0 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

30 TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 80 0 

TTACAAGAAC GTATTACATC T AC T AAT AAA GGTTCAGTCA CTTCAA 846 



35 



50 



2) INFORMATION FOR SEQ ID NO: 373 



( i ) SEQUENCE CHARACTERI STI CS : 
4 0 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus homlnis 

(B) STRAIN : CSG 175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 



TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT CAAATTAACA TTAGAAGTTG CATTGCAACT 100 

55 AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 200 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

60 GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 
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TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



2) INFORMATION FOR SEQ ID NO: 374 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
25 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 50 

30 AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 200 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 2 50 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 3 00 

35 CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 3 50 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 4 00 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 4 50 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 550 

4 0 CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 750 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

45 GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 835 



2) INFORMATION FOR SEQ ID NO: 375 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Staphylococcus saprophyticus 

224 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 

GAGCACAATG AAGTTCCAGA AATTAACAAT GCCTTAGTCG TAGACGTTGA 50 

AAGAGATGAA GGTACAGTAT CTCTTACATT AGAAGTGGCA TTACAACTTG 100 

GCGATGATGT CGTACGTACA ATTGCAATGG ATTCTACTGA TGGTGTTAAA 150 

CGTGGTACAG AAGTTCGAGA TAGCGGAGAT AGCATCAGTG TTC CAGTTGG 2 00 

TGATGCTACG TTAGGACGTG TGTTTAATGT TCTTGGTGAT ACAATTGACT 250 

TAGACGAGAA GCTTGATACT TCTGTCAAAC GTGATCCAAT TCATAGAGAA 300 

GCACCTGCAT TCGATCAATT ATCAACAAAA GTTGAAATCT TAGAAACAGG 350 

TATTAAAGTA ATTGATTTAC TTGCACCATA TATTAAAGGT GGTAAAATCG 400 

GTTTATTCGG TGGCGCTGGT GTAGGTAAAA CAGTATTAAT TCAAGAATTA 450 

ATTAATAATA TAGCTCAAGA ACATGGTGGT ATTTCAGTAT TTGCCGGCGT 500 

AGGTGAACGT ACGCGTGAAG GTAATGACTT ATACTACGAA ATGAGTGATA 550 

GTGGTGTTAT TAAGAAAACA GCTATGGTCT TCGGACAAAT GAATGAGCCA 600 

CCTGGTGCGC GTATGCGTGT TGCTTTATCA GGCTTAACAA TGGCTGAACA 650 

CTTCCGTGAT GTACAAGGAC AAGATGTTTT ACTATTTATT GATAACATAT 700 

TCAGATTTAC GCAAGCTGGT TCAGAAGTAT CAGCACTATT AGGTCGTATG 750 

CCATCAGCCG TTGGTTATCA ACCTACCCTT GCTACTGAAA TGGGTCAATT 800 

ACAAGAACGT ATTACATCAA CAACTAAAGG ATCTGTAACG TC 842 



2) INFORMATION FOR SEQ ID NO : 376 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 

TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 100 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 200 

AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 250 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 3 00 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 350 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 400 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 700 

CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 
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2 ) INFORMATION FOR SEQ ID NO: 377 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE : Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

20 GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 2 00 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 250 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 3 00 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 3 50 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 4 00 

25 TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 4 50 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

3 0 CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 700 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 750 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACATCTACAA ATAAAGGTTC TGTAACATCT A 841 
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2) INFORMATION FOR SEQ ID NO: 378 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 

TTTAACACGA ATGAACCGCT TCCTGAGATA AATAATGCAC TTGTTGTTTA 50 

CAAAGACAGT GAGAAAAAAC ATAAAATCGT TCTTGAAGTA GCTCTTGAAC 100 

55 TTGGTGAAGG CCTCGTTCGT ACCATTGCTA TGGAATCAAC TGATGGTTTG' 150 

ACACGTGGTC TAGAAGTTCT TGATACAGGC CGTGCAATCA GTGTACCAGT 200 

TGGTAAAGAA ACGCTTGGAC GTGTCTTCAA CGTTCTTGGT GATGCTATCG 250 

ATCTTGAAGA ACCATTTGGA GAAGATGCAG AACGTCACCC CATTCATAAG 3 00 

AGTGCCCCAA CTTTTGATGA ATTATCAACG TCAACAGAAA TCCTTGAAAC 350 

60 AGGGATTAAA GTTATCGACC TACTTGCCCC TTACTTAAAA GGAGGGAAGG 4 00 
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TTGGACTTTT CGGTGGTGCC GGAGTTGGTA AGACCGTTCT TATCCAAGAG" 4*5"u 

TTGATTCATA ACATTGCTCA AGAGCATGGT GGTATTTCAG TATTTACCGG 500 

AGTTGGTGAA CGTACACGTG AAGGTAATGA CCTCTATTGG GAAATGAAAG 550 

AATCAGGCGT TATTGAAAAA ACAGCTATGG TATTTGGTCA GATGAATGAG 600 

CCACCTGGTG CACGTATGCG TGTAGCCCTT ACTGGTTTGA CAATCGCTGA 650 

ATATTTCCGT GATGTTGAAG GACAGGACGT GCTTCTCTTT ATTGATAACA 700 

TTTTTCGTTT CACACAAGCA GGTTCTGAAG TTTCAGCTCT TCTTGGACGT 750 

ATGCCATCAG CCGTTGGTTA TCAACCAACC TTGGCAACTG AAATGGGTCA 800 

ATTGCAAGAA CGTATCACGT CAACTAAAAA AGGTTCTGTT ACATCA 846 



2) INFORMATION FOR SEQ ID NO: 3 79 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Streptococcus agalactiae 
25 (B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

3 0 AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

35 AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

4 0 ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

45 TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



2) INFORMATION FOR SEQ ID NO: 380 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus agalactiae 
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(B) STRAIN: ATCC 13813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 

5 TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 5 0 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGC CGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

10 TCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 35 0 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

15 GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 65 0 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 84 6 



25 2 ) INFORMATION FOR SEQ ID NO: 381 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 200 

45 GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

50 TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 60 0 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

55 CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 
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2) INFORMATION FOR SEQ ID NO: 382 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA - 150 

2 0 CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

25 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATC CACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

AT CTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

3 0 TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 
TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 8 00 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 



35 



50 



2) INFORMATION FOR SEQ ID NO: 383 



(i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

55 TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

60 GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 
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TGGACTTTTC GGTGGTGCAG 
TAATCCACAA CATCGCCCAA 
GTAGGAGAAC GTACTCGTGA 
ATCTGGCGTT ATTGAAAAAA 
5 CACCAGGAGC ACGTATGCGT 
TACTTCCGTG ATGTAGAAGG 
CTTCCGTTTC ACACAAGCTG 
TGCCTTCAGC CGTTGGTTAT 
TTACAAGAGC GTATCACTTC 



GTGTTGGTAA AACCGTTCTT 
GAACATGGTG GTATTTCAGT 
AGGGAATGAC CTTTATTGGG 
CGGCTATGGT CTTTGGTCAA 
GTGGCACTTA CTGGTCTTAC 
ACAAGATGTG CTTCTCTTCA 
GGTCAGAAGT GTCAGCGCTT 
CAACCAACAC TTGCTACAGA 
AACTAAAAAA GGTTCTGTTA 



PCT/CA00/01150 

ATTCAAGAA^ 450 

ATTTACTGGT 500 

AAATGAAAGA 550 

ATGAATGAAC 600 

AATAGCTGAG 650 

TTGATAATAT 70 0 

TTAGGTCGTA 750 

AATGGGACAA 800 

CATCA 845 



2) INFORMATION FOR SEQ ID NO: 3 84 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 
25 (B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT CCAGAGATTA ATAATGCATT GATTGTTTAT 50 

30 AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTACCAGTA 200 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 3 00 

35 AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 350 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

40 ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 700* 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

45 TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 845 



2) INFORMATION FOR SEQ ID NO: 385 



50 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus egui subsp. egui 
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(B) STRAIN: ATCC 952 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

5 TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 250 

10 CCTAGAAGCA C C ATTTGC AG ATGATGTTAA TCGTGAACCG ATCCATAAAA 3 00 

AAGCACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 3 50 

GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 500 

15 GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 600 

CACCAGGAGC C CGT ATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 84 6 



25 2) INFORMATION FOR SEQ ID NO: 386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 



TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 200 

45 GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 250 

TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 350 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 400 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

50 TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

55 TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 750 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 846 



60 
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2) INFORMATION FOR SEQ ID NO: 3 87 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

GCAGCTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 50 

TGATGAACAA AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 100 

GAGAAGGTGT GGTTCGTACC ATTGCCATGG AATCTACTGA TGGATTGACT 150 

CGTGGACTAG AAGTTCTGGA CACTGGTCGT CCAATCAGCG TTCCTGTTGG 200 

TAAAGATACC CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 250 

TGGAAGCACC TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 300 

GCACCAACTT TCGATGAATT GTCAACATCT ACTGAAATCC TTGAAACAGG 3 50 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GGTAAAGTCG 400 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 500 

TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 600 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 650 

CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 700 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCTACACTT GCTACTGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA TCT 843 



2) INFORMATION FOR SEQ ID NO: 388 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC. 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 

TTGCAGCAGA AGATAAACTT CCTGAGATTA ACAACGCACT CGTTGTATAT 50 

AAAAATGATG ATTCCAAACA AAAAGTCGTG CTTGAAGTGG CTTTGGAACT 100 

TGGTGATGGC GTTGTACGGA CCATTGCCAT GGAATCAACG GATGGATTGA 150 

CACGTGGGAT GGAAGTTCTC GATACAGGTC GTCCCATCTC TGTTCCAGTC 200 

GGTAAAGAAA CGCTGGGTCG TGTCTTCAAT GTGTTGGGAG ATACCATTGA 250 

CCTTGAAGAG TCTTTTCCGG CAGATTTTGA ACGTGAGCCT ATCCATAAGA 300 

AAGCGCCGGC TTTTGACGAA TTATCTACTT CAAGCGAAAT TTTGGAAACA 350 

GGGATTAAGG TTATCGACCT CCTAGCACCT TATCTAAAAG GTGGTAAGGT 400 
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TGGTCTCTTC GGTGGTGCTG GTGTTGGTAA AACCGTTCTT ATCCAAGAAT 4 50 

TGATTCACAA TATTGCCCAA GAACACGGTG GTATCTCTGT ATTTACCGGA 50 0 

GTTGGCGAGC GTACCCGTGA AGGGAACGAT CTTTACTGGG AAATGAAAGA 550 
ATCAGGTGTT ATTGAAAAAA CGGCCATGGT ATTTGGTCAG ATGAATGAGC . 600 

5 CACCAGGAGC CCGTATGCGT GTTGCTCTTA CTGGTTTGAC TATTGCGGAA 650 

TACTTCCGTG ATGTGGAAGG GCAGGATGTT CTTCTGTTCA TCGATAATAT 700 

CTTCCGTTTC ACACAGGCTG GTTCAGAAGT GTCTGCCCTC TTGGGTCGTA 750 

TGCCATCAGC CGTTGGTTAT CAGCCAACAC TTGCGACGGA GATGGGACAA 800 

TTGCAGGAGC GTATTACCTC AACCAAGAAG GGTTCTGTTA C 841 



2) INFORMATION FOR SEQ ID NO : 389 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uberis 
25 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

3 0 AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 10 0 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 150 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 2 00 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 250 

TGGATGAACC ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 3 00 

35 GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 35 0 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 400 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

4 0 CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 8 00 

45 GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



50 



2) INFORMATION FOR SEQ ID NO: 390 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Tatumella. ptyseos 
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(B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

TTCCCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 5 

TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 10 

TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 150 

AAGGTGACCG ATCTGCAAAA ACCGATTCAG GTACCGGTCG GTAAAGCGAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTGAACTC TATTCACCGT 3 00 

GCAGCGCCAA GCTACGAAGA ACTGGCTAAC TCTACTGAGC TGCTGGAAAC 350 

GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 4 00 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 450 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 500 

GGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGACCG 550 

AGTCTAACGT TCTGGATAAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GATAACATCT 700 

ATCGTTATAC CCTGGCCGGT ACTGAAGTTT CAGCACTGCT GGGTCGTATG 750 

CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 800 

TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 896 



2) INFORMATION FOR SEQ ID NO: 391 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 100 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

CGGGTACTGA AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 8 00 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 829 
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2) INFORMATION FOR SEQ ID NO: 392 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43 970 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 

CGAATTCCCC CAAGACGCTG TACCAAAAGT GTACAACGCC CTTGAGGTTG 50 

AAGGCACAGC TCAGAAGCTG GTGCTGGAAG TTCAGCAACA GCTGGGCGGT 100 

GGTGTTGTTC GTTGTATCGC AATGGGCTCT TCCGATGGTC TGAGCCGCGG 150 

20 GTTGAAAGTC ATCAACCTGG AACACCCAAT TGAAGTGCCG GTGGGTAAAT 2 00 

CAACTCTGGG CCGTATCATG AACGTATTGG GTGACCCAAT CGACATGAAA 2 50 

GGTCCTATCG GTGAAGAAGA GCGTTGGGCA ATCCACCGCG AAGCGCCTTC 3 00 

TTACGAAGAG CTTGCCAGCT CGCAAGATCT GTTAGAAACC GGTATCAAGG 350 

TAATGGATCT GATTTGTCCG TTCGCTAAGG GCGGTAAAGT CGGTCTGTTC 400 

25 GGTGGTGCGG GTGTGGGTAA AACAGTCAAC ATGATGGAGC TGATTCGTAA 450 

TATTGCGATT GAGCACTCAG GTTATTCTGT ATTTGCCGGT GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AGATGACTGA CTCCAACGTT 550 

CTGGACAAAG TATCCTTGGT TTATGGCCAG ATGAATGAGC CACCAGGTAA 600 

CCGTCTGCGC GTTGCACTGA CCGGCTTGAC CATGGCGGAG AAATTCCGTG 650 

30 ATGAAGGTCG TGATGTACTG TTATTCATCG ATAACATCTA TCGTTATACC 700 

CTGGCCGGTA CAGAGGTATC TGCACTGCTA GGTCGTATGC CATCAGCGGT 750 

AGGCTATCAG CCAACGCTGG CAGAAGAGAT GGGTGTGTTG CAGGAACGTA 8 00 

TCACTTCCAC CAAGACGGGT TCAATCACCT CCGTA 835 



35 



50 



2) INFORMATION FOR SEQ ID NO: 3 93 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTGGG CGGTGGTGTT GTTCGTTGTA 100 

55 TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 2 00 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 250 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC 300 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 350 

60 TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 
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GTAAAACGGT AAACATGATG GAGCTTATTC 
TCAGGTTATT CCGTATTTGC TGGCGTGGGT 
CGACTTCTAC CACGAGATGA CTGACTCCAA 
TGGTTTATGG CCAAATGAAT GAGCCACCAG 
5 CTGACCGGCT TGACCATGGC GGAGAAATTC 
ATTGCTGTTC ATCGATAACA TCTATCGCTA 
TTTCCGCACT GCTGGGTCGT ATGCCATCTG 
CTGGCAGAAG AGATGGGTGT GTTGCAGGAA 
GGGTTCAATC AC 



PCT/CAOO/01150 

GTAACATTGC GATTGAGCAC 450 

GAGCGTACTC GTGAGGGTAA 500 

CGTTCTGGAC AAAGTATCCT 550 

GTAACCGTCT GCGCGTTGCA 600 

CGTGATGAAG GTCGTGACGT 650 

TACCTTAGCC GGTACGGAAG 700 

CCGTAGGTTA CCAGCCAACG 750 

CGTATTACTT CCACCAAGAC 800 

812 



2) INFORMATION FOR SEQ ID NO: 394 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 
25 (B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

30 GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 200 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 3 00 

35 GATCTGTTAG AAACCGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 350 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

4 0 GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 650 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATCCGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCACCAAGA CGGGTTCAAT 800 

45 CA 802 



50 



2) INFORMATION FOR SEQ ID NO: 395 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Yersinia intermedia 

236 
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(B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 95 

5 GCTGTACCAA GAGTGTACAA CGCCCTTGAG GTTGAAGGCA CTGCTGAGAA 5 0 

GCTGGTGCTG GAAGTTCAGC AACAGCTAGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GCGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCGGTTGGT AAATCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 250 

10 AAGAGCGTTG GGCAATCCAC CGCGAAGCGC CTTCTTACGA AGAGCTTGCC 300 

AGCTCACAAG ATTTGTTAGA AACCGGTATC AAAGTAATGG ACTTGATTTG 350 

CCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACAGT AAACATGATG GAGCTTATTC GTAACATCGC GATTGAGCAC 450 

TCAGGTTATT CTGTATTTGC TGGTGTGGGT GAGCGTACTC GTGAGGGTAA 50 0 

15 CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTGTATGG CCAGATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGTTGTTC ATCGATAACA TCTATCGCTA TACCTTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCAACG 750 

2 0 CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACGT CCACCAAGAC 800 

GGGTTC 8 06 



25 2) INFORMATION FOR SEQ ID NO: 396 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 50 

GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 150 

CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 200 

45 CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 3 00 

AGCTCACAAG ATCTGTTAGA AACCGGTATC AAGGTTATGG ACCTGATTTG 3 50 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

50 TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAGCGTACCC GTGAGGGTAA 500 

TGACTTCTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC AAAGTATCCT 550 

TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

55 TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGCCAACA 750 

CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACTAAGAC 800 

GGGTTC 806 



60 
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2) INFORMATION FOR SEQ ID NO: 3 97 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 433 80 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97 

TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGCTTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 150 

20 AAAGTTATCA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAAT CAAC 200 

TCTGGGCCGT ATCATGAACG TATTGGGTGA CCCAATCGAC ATGAAAGGCC 2 50 

CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 3 00 

GAAGAGCTTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 350 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

25 GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGCCAGATGA ATGAGC CACC AGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CTTGACCATG GCGGAAAAAT TCCGTGATGA 650 

30 AGGCCGTGAC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC 750 

TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 800 

TTCCACTAAG ACGGGTTCAA TCACCTCCG ' 829 



35 



50 



2) INFORMATION FOR SEQ ID NO: 398 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAAAA TGGTAACGAG 50 

AAACTGGTGC TGGAAGCTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

55 TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC CCTGGGCCGT 200 

ATCATGAACG TCCTGGGTCA GCCGATCGAC ATGAAAGGCG ACATCGGTGA 2 50 

AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCTTCCTAT GAAGAGCTGT 300 

CCAGCTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTAAT GGATCTGATC 350 

60 TGCCCGTTCG CTAAGGGTGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 400 
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AGGTAAAACT GTAAACATGA TGGAGCTTAT 
ACTCCGGTTA CTCTGTGTTT GCAGGCGTGG 
AACGACTTCT ACCACGAAAT GACCGACTCC 
CCTGGTGTAC GGCCAGATGA ACGAGCCGCC 
5 CGCTGACCGG CCTGACCATG GCTGAGAAAT 
GTTCTGCTGT TCGTCGATAA CATCTACCGT 
AGTATCCGCA CTGCTGGGTC GTATGCCTTC 
CTCTGGCGGA AGAGATGGGT GTTCTTCAGG 
ACCGGTTCTA TCACCTCCG 



PCT/CA00/O115O 

CCGTAACATC GCGATTGAGC ' 4^0' 

GTGAACGTAC TCGTGAGGGT 500 

AACGTACTGG ATAAAGTATC 550 

GGGAAACCGT CTGCGCGTTG 600 

TCCGTGACGA AGGCCGTGAC 650 

TATACCCTGG CCGGTACGGA 700 

TGCGGTAGGT TATCAGCCAA 750 

AACGTATCAC CTCTACCAAA 8 00 

819 



2) INFORMATION FOR SEQ ID NO: 3 99 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 
25 (B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 

AAGCTTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 50 

GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 150 

GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGT CAGACCCGAG AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 2 50 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 3 00 

CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 3 50 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 40 0 

ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 4 50 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 500 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTTACCTT 650 

CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 70 0 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 750 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 900 

TGCCACACTG CCCAGATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CGACCGACGA ACCGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC * 1050 

GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 10 97 



30 



35 



40 



45 
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2) INFORMATION FOR SEQ ID NO: 4 00 

55 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 4 6775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 

10 CAAGCTTAAG GCTGAACGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG TACCACGTTA CCGTCATTGA TGCCCCTGGC 100 

CATCGTGATT TCATCAAGAA CATGATTACT GGTACTTCCC AAGCTGACTG 150 

CGGTATCTTG ATTATTGCTG CTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CCAAGGATGG TCAAACCCGT GAACACGCTT TGCTTGCTTT CACCCTTGGT 250 

15 GTCCGTCAAT TGATTGTCGC TATCAACAAG ATGGATTCCA CCAAGTACTC 300 

TGAGGCCCGT TACAACGAAA TTGTCAAGGA AGTCTCCACC TTCATCAAGA 350 

AGATTGGTTT CAACCCCAAG TCCGTTCCTT TCGTCCCTAT CTCTGGCTGG 400 

AACGGTGACA ACATGTTGGA GGARTCCACC AACATGCCTT GGTTCAAGGG 450 

ATGGAACAAG GAGACTAAGG CTGGTGCCAA GACYGGCAAG ACCCTTCTTG 500 

20 AAGCCATTGA CAACATTGAT CCCCCTGTTC GTCCTTCCGA CAAGCCCCTT 550 

CGTCTTCCCC TTCAAGATGT CTACAAGATC GGTGGTATTG GTACAGTTCC 600 

TGTCGGTCGT GTTGAGACTG GTGTCATCAA GCCTGGTATG GTTGTCACCT 650 

TCGCTCCCGC TAACGTCACC ACTGAAGTCA AGTCCGTYGA AATGCACCAC 700 

GAGCAACTTG CTGAAGGTGT TCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

25 GAACGTTTCC GTCAAGGATA TCCGCCGTGG TAACGTYTGC TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA TCCGCTTCCT TCACCGCTCA AGTTATTGTC 850 

TTGAACCACC CTGGTCARAT TGGTGCTGGT TACTCTCCTG TCTTGGATTG 900 

CCACACTGCT CACATTGCAT GCAAGTTCTY TKAGCTTCTT KAGAAGATCG 950 

ATYGTCGTTC CGGTAAGTAA ATANTTTGGT TTRGGATATG GGTATTGGGC 100 0 

3 0 TTAATCTYTG GATTTTGCCT CAATTGCTCC TTCCTTGATC TTTCTCGATT 105 0 

ACTTTTTGAT CATTTGCTAA TCCAAACCCT TTCCATTTYA TTGAAAACAG 1100 

GTAAGAAGTT GGAAGACTCC CCCAAGTTCG TCAAGWSYGG TGACTCTGCY 1150 

ATCGTCAAGA TGGTTCCTTC CAAGCCCATG TGCGTTGAAG CCTACACTGA 12 00 

ATATCCTCCT CTTGGTCGTT TCGCTGTCCG TGA 1233 

35 



2) INFORMATION FOR SEQ ID NO: 4 01 

40 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
50 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

55 GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 100 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 200 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 300 

60 ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 3 50 

240 
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CAAGTGGTCC GAGGAGCGTT ACCAGGAGAT 
TCATCAAGAA GGTCGGCTAC AACCCCAAGC 
TCCGGTTTCA ACGGTGACAA CATGATTGAG 
GTACAAGGGT TGGGAGAAGG AGACCAAGGC 
5 TCCTCGAGGC CATCGACGCC ATCGACCCTY 
CCCCTCCGTC TTCCCCTYCA GGATGTTTAC 
GGTGCCCGTC GGTCGTGTCG AGACCGGTAT 
TCACCTTCGC CCCCGCTGGT GTCACCACTG 
CACCACGAGC AGCTCACCGA GGGTGTCCCC 

10 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG 
ACTCCAAGAA CGACCCCCCC AAGGGTGCCG 
ATCGTCCTCA ACCACCCTGG TCAGGTCGGT 
. CGACTGCCAC ACCGCCCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 

15 AAGTC CGGTG ACGCCGCCAT CGTCAAGATG 
CGTTGAGGCT TTCACTGACT ACCCTCCTCT 
A 



PCT/CA00/01150 

CATCAAGGAG ACCTCCAACT' 4~00 

ACGTTCCCTT CGTCCCCATC 450 

GCCTCATCCA ACTGCCCCTG 500 

CAAGGCCACT GGTAAGACCC 550 

CCAGCCGTCC CACCGACAAG 600 

AAGATTGGTG GTATTGGCAC 650 

CATCAAGGCC GGTATGGTCG 700 

AAGTCAAGTC CGTCGAGATG 750 

GGTGACAACG TCGGCTTCAA 80 0 

TCGTGGTAAC GTTGCCGGTG 850 

AGTCCTTCAA CGCCCAGGTC 900 

GCTGGTTACG CCCCAGTCCT 950 

GTTCTCTGAG CTCCTCGAGA 1000 

AGAACTCTCC CAAGTTCATC 1050 

GTTCCCTCCA AGCCCATGTG 1100 

CGGTCGTTTC GCTGTCCGTG 1150 

1151 



20 



2) INFORMATION FOR SEQ ID NO: 4 02 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1283 bases 
25 (B) TYPE: Nucleic acid' 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 10 0 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTTCAAT 150 

40 AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 2 00 

CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 250 

GAGGCTGGTA TCTCCAAGGA TGGTCAGACC CGTGAGCACG CTCTGCTCGC 3 00 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 350 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 400 

45 AACTTCATCA AGAAGGTCGG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 450 

CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 500 

CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 550 

AAGACCCTTC TCGAGGCCAT CGATGC CATC GAGCCCCCCG TCCGTCCCAC 600 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

50 TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGCCTGGT 700 

ATGGTCGTTA CTTTCGCTCC TGCCAACGTG ACCACTGAAG TCAAGTCCGT 750 

TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 800 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 850 

GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 900 

55 CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGCTGAGCTC 1000 

CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG ACAAGCCCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

60 GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 1200 

241 
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ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 1250 
TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 1283 



2 ) INFORMATION FOR SEQ ID NO: 403 

( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 1103 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatzus 

(B) STRAIN: DAL95 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 403 

TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

25 GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 200 

CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 250 

ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 300 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC ACCTGCAAGT GGTCCGAGGA 350 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 400 

3 0 GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 450 

GACAACATGC TTGAGCCCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

GAAGGAGACC AAGGCCGGCA AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

TCGACGCCAT TGAGCCCCCT GTCCGTCCCT CCAACAAGCC CCTCCGTCTT 600 

CCCCTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

35 CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 700 

CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

CTCCAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 800 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

40 CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

TGCCCACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 1000 

GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 1100 

CAC 1103 

45 



2) INFORMATION FOR SEQ ID NO: 4 04 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55- 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
60 (B) STRAIN: WSA-172 

242 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 



AAGCTCAAGT 
5 GAAGTTCCAG 
CGCCCCGATA 
GGTCACCGTG 
CTGCGCTATC 
TCTCCAAGGA 

10 GGTGTCAAGC 
GTCCGAGGAT 
AGAAGGTCGG 
TTCAACGGTG 
GGGATGGGAG 

15 TCGAGGCCAT 
CTCCGTCTTC 
CCCTGTCGGC 
CCTTCGCCCC 
CACCAGCAGC 

2 0 CAAGAACGTT 
CCAAGAACGA 
GTCCTCAACC 
CTGCCACACT 
TTGACCGCCG 

25 TCCGGTGATG 
CGAGTCCTTC 



CCGAGCGTGA 
ACTCCCAAGT 
TGTTTTGGTG 
ACTTCATCAA 
CTCATCATTG 
TGGCCAGACC 
AGCTCATCGT 
CGTTACAACG 
CTACAACCCC 
ACAACATGCT 
AAGGAGACCA 
CGACGCCATT 
CCCTCCAGGA 
CGTGTCGAGA 
CGCCAACGTC 
TCCAGGAGGG 
TCCGTCAAGG 
TCCCCCTCAG 
ACCCCGGTCA 
GCCCACATTG 
TACCGGCAAG 
CCGCCATCGT 
ACTGACTACC 



GCGTGGTATC 
ATGAGGTCAC 
CTGTAGCTAA 
GAACATGATC 
CCTCCGGTAC 
CGTGAGCACG 
CGCCCTCAAC 
AAATTGTCAA 
AAGGCCGTTC 
TGAGCCCTCC 
AGGCCGGCAA 
GAGCCCCCTG 
TGTCTACAAG 
CCGGTATCAT 
ACCACTGAAG 
TGTCCCCGGT 
AAGTCCGCCG 
GGTGCTGCCT 
GGTCGGCGCT 
CTTGCAAGTT 
TCTGTTGAGA 
GAAGATGGTT 
CCCCTCTGGG 



ACCATCGACA 
TGTCATCGGT 
CACGATCTGA 
ACTGGTACCT 
TGGTGAGTTC 
CTCTGCTGGC 
AAGATGGACA 
GGAAACCTCC 
CCTTCGTCCC 
TCCAACTGCC 
GGTCACTGGT 
TCCGTCCCTC 
ATCTCTGGTA 
CAAGCCCGGC 
TCAAGTCCGT 
GACAACGTCG 
TGGTAACGTC 
CCTTCAACGC 
GGTTACGCCC 
CTCTGAGCTG 
ACAACCCCAA 
CCTTCCAAGC 
TCGTTTCGCC 



TTGCCCTCTG 50 

AAGCTCGACT 100 

AGATGCCCCC 150 

CCCAGGCTGA 200 

GAGGCTGGTA 250 

TTTCACCCTC 3 00 

CCTGCAAGTG 350 

AACTTCATCA 400 

CATCTCTGGC 45 0 

CCTGGTACAA 500 

AAGACCCTCA 550 

CAACAAGCCC 60 0 

TCGGAACGGT 650 

ATGGTCGTCA 700 

CGAGATGCAC 750 

GTTTCAACGT 800 

TGCGGTGACT 850 

CCAGGTCATC 900 

CCGTCCTCGA 950 

CTTGAGAAGA 1000 

GTTCATCAAG 1050 

CCATGTGTGT 1100 

GTCCGTGAC 114 9 



30 2 ) INFORMATION FOR SEQ ID NO: 405 



35 



40 



45 



50 



55 



60 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) S TRANDEDNE S S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 



CAAGCTCAAG 
GGAAGTTCCA 
CAGAGTTCTC 
CCCGGTCACC 
TGACTGCGCT 
GTATCTCCAA 
CTCGGTGTCC 
GTGGTCCGAG 
TCAAGAAGGT 
GGTTTCAACG 
CAAGGGTTGG 
TCCTTGAGGC 
CCCCTCCGTC 
TGTTCCCGTC 
TGACCTTCGC 



TCCGAGCGTG 
GACTGGCAAG 
TTTTCATCAA 
GTGACTTCAT 
ATCCTCATCA 
GGATGGCCAG 
GCCAGCTCAT 
GACCGTTACA 
CGGATACAAC 
GTGACAACAT 
GAGAAGGAGA 
CATCGACGCC 
TTCCCCTCCA 
GGTCGTGTCG 
TCCCGCCAAC 



AGCGTGGTAT 
TATGAGGTCA 
AGCAATATAC 
CAAGAACATG 
TTGCCTCCGG 
ACTCGTGAGC 
CGTTGCCCTC 
ACGAAATCGT 
CCCAAGGGTG 
GCTCGAGCCC 
CCAAGGCCGG 
ATCGAGCCCC 
GGATGTCTAC 
AGACCGGTAT 
GTCACCACTG 
24 



CACCATCGAC 
CCGTCATTGG 
TAACGTCCAT 
ATCACTGGTA 
TACTGGTGAG 
ACGCTCTGCT 
AACAAGATGG 
TAAGG AG AC C 
TTCCTTTCGT 
TCCCCCAACT 
CAAGGTCACC 
CCGTCCGTCC 
AAGATCTCCG 
CATTGCCCCT 
AAGTCAAGTC 
3 



ATTGCCCTCT 50 

TATGTACTCA 100 

CATAGACGCC 150 

CCTCCCAGGC 2 00 

TTCGAGGCTG 250 

TGCTTTCACC 300 

ACACCTGCAA 350 

TCCAACTTCA 4 00 

CCCCATCTCC 450 

GCCCCTGGTA 500 

GGTAAGACCC 550 

CTCCAACAAG 600 

GTATTGGAAC 650 

GGTATGGTCG 700 

CGTTGAGATG 750 
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CACCACCAGC AGCTCAAGGA AGGTGTCCCC 
CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG 
ACTCCAAGAA CGACCCCCCT CTTGGCTGTG 
ATCGTCCTCA ACCACCCCGG TCAGGTCGGC 
5 GGACTGCCAC ACTGCTCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 
AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 
TGTTGAGGCT TTCACTGACT ACCCCCCTCT 
A 



PCT/CA00/01150 

GGTGACAACG TTGGTTTCAA 80 0 

CCGTGGTAAC GTTGCCGGTG 850 

AGAGCTTCAC CGCCCAGGTC 90 0 

GCTGGTTACG CTCCCGTCCT 95 0 

GTTCGCTGAG CTCCTTGAGA 1000 

AATCTTCCCC CAAGTTCATC 1050 

ATTCCCTCCA AGCCCATGTG 1100 

TGGTCGTTTC GCCGTCCGCG 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 406 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capita tus 
25 (B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 

GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATATC GCTCTCTGGA 50 

30 AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATGC TCCAGGTCAC 10 0 

CGTGATTTCA TCAAGAACAT GATTACTGGT ACTTCCCAAG CCGATTGCGC 15 0 

CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 200 

AGGAAGGTCA AACCAGAGAA CACGCTCTTC TCGCTTTCAC CCTTGGTGTC 250 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 300 

35 AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 400 

GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 450 

GACCAAGGAA ACCAAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 500 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

40 CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

CGGCCGTGTC GAAACCGGTA TCATCAAGGC CGGTATGGTC GTTACCTTTG 650 

CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 800 

45 ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 850 

AACCACCCTG GTCAAATCTC TGCTGGTTAC TCTCCAGTTC TCGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 1000 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGCCAATGT GCGTTGAAAC 1050 

50 CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 1093 



2 ) INFORMATION FOR SEQ ID NO : 407 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

244 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

10 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

15 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTYGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTC CATGGT A 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

20 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 60 0 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

25 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 8 00 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 90 0 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 95 0 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 0 0 

30 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 105 0 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 110 0 

A 1101 



35 



45 



2) INFORMATION FOR SEQ ID NO: 408 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1089 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 50 

TCGAAACTCC AAAATACCAC GTTACCGTCA TTGATGCTCC AGGTCACAGA 10 0 

GATTTCATCA AGAATATGAT CACTGGTACT TCTCAAGCTG ATTGTGCTAT 150 

55 TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 200 

ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 250 

CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 3 00 

CAGATTTGAA GAAATCATCA AGGAAAC CTC CAACTTCGTC AAGAAGGTTG 350 

GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 4 00 

60 GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 450 

245 
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AAAGGAAACC AAATCCGGTA AAGTTACTGG TAAGACCTTG TTAGAAGCTA 500 

TTGACGCTAT TGAACCACCA ACCAGACCAA CCGACAAACC ATTGAGATTG 550 

CCATTGCAAG ATGTTTACAA GATCGGTGGT ATTGGTACTG TGCCAGTCGG 600 

TAGAGTTGAA ACTGGTATCA TCAAAGCCGG TATGGTTGTT ACTTTCGCCC 650 

5 CAGCTGGTGT TACCACTGAA GTCAAATCCG TTGAAATGCA TCACGAACAA 700 

TTGGCTGAAG GTGTTCCAGG TGACAATGTT GGTTTCAACG TTAAGAACGT 750 

TTCCGTTAAA GAAATTAGAA GAGGTAACGT TTGTGGTGAC TCCAAGAACG 8 00 

ATCCACCAAA GGGTTGTGAC TCTTTCAATG CCCAAGT CAT TGTTTTGAAC 850 

CATCCAGGTC AAATCTCTGC TGGTTACTCT CCAGTCTTGG ATTGTCACAC 900 

10 TGCCCACATT GCTTGTAAAT TCGACACTTT GGTTGAAAAG ATTGACAGAA 950 

GAACTGGTAA GAAATTGGAA GAAAATCCAA AATTCGTCAA ATCCGGTGAT 1000 

GCTGCTATCG TCAAGATGGT CCCAACCAAA CCAATGTGTG TTGAAGCTTT 1050 

CACTGACTAC CCACCATTAG GTAGATTCGC TGTCAGAGA 1089 



15 



30 



60 



2 ) INFORMATION FOR SEQ ID NO : 4 09 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 56884 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 



CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

35 CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

4 0 TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

45 TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

50 ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 90 0 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 0 0 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

55 A 1101 



2 ) INFORMATION FOR SEQ ID NO: 410 

246 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTAC CACTG AAGTCAAATC CGTTGAAATG 7 00 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT H02 



2 ) INFORMATION FOR SEQ ID NO: 411 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1102 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

247 
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GTATTTCTAA GGATGGTCAA 
TTGGGTGTCA AACAATTGAT 
ATGGGACAAA AACAGATTTG 
TCAAGAAGGT TGGTTACAAC 
5 GGTTGGAATG GTGACAACAT 
CAAGGGTTGG GAAAAGGAAA 
TGTTAGAAGC TATTGACGCT 
CCATTGAGAT TGCCATTGCA 
TGTGCCAGTC GGTAGAGTTG 

10 TTACTTTCGC CCCAGCTGGT 
CATCACGAAC AATTGGCTGA 
CGTTAAGAAC GTTTCCGTTA 
ACTCCAAGAA CGATCCACCA 
ATTGTTTTGA ACCATCCAGG 

15 GGATTGTCAC ACTGCCCACA 
AGATTGACAG AAGAACTGGT 
AAATCCGGTG ATGCTGCTAT 
TGTTGAAGCT TTCACTGACT 
AT 

20 



ACCAGAGAAC ACGCTTTGTT 
TGTTGCTGTC AACAAGATGG 
AAGAAATCAT CAAGGAAACC 
CCAAAGACTG TTCCATTCGT 
GATTGAACCA TCCACCAACT 
CCAAATCCGG TAAAGTTACT 
ATTGAACCAC CAACCAGACC 
AGATGTTTAC AAGATCGGTG 
AAACTGGTAT CATCAAAGCC 
GTTACCACTG AAGTCAAATC 
AGGTGTTCCA GGTGACAATG 
AAGAAATTAG AAGAGGTAAC 
AAGGGTTGTG ACT CTTTCAA 
TCAAATCTCT GCTGGTTACT 
TTGCTTGTAA ATTCGACACT 
AAGAAATTGG AAGAAAATCC 
CGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 
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GGCTTACACT ' 


25<f 


ACTCTGTCAA 


300 


TCCAACTTCG 


350 


TCCAATCTCT 


400 


GTCCATGGTA 


450 


GGTAAGACCT 


500 


AACCGACAAA 


550 


GTATTGGTAC 


600 


GGTATGGTTG 


650 


CGTTGAAATG 


700 


TTGGTTTCAA 


750 


GTTTGTGGTG 


800 


TGCCCAAGTC 


850 


CTCCAGTCTT 


900 


TTGGTTGAAA 


950 


AAAATTCGTC 


1000 


AACCAATGTG 


1050 


GCTGTCAGAG 


1100 




1102 



2 ) INFORMATION FOR SEQ ID NO: 412 

25 { i ) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
35 (B) STRAIN: NCPF 3108 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

4 0 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 20 0 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

45 ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

50 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

55 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 90 0 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

60 TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

248 
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WO 01/23604 

A 



PCT/CA00/01150 

1101 



5 2) INFORMATION FOR SEQ ID NO: 413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 



GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 150 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

25 TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 250 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 300 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 400 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 450 

30 GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTC CGT CGAGATGCAC 700 

35 CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

40 TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 1098 



45 



55 



2) INFORMATION FOR SEQ ID NO: 414 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1102 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3 94 9 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

249 
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10 



15 



20 



25 



CTTGGATAAA 
CTTTGTGGAA 
CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 
TCACTTTTGC 
CATCACGAAC 
CGTTAAGAAT 
ACTCCAAGAA 
ATTGTCTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATC CGGTG 
TGTTGAAGCT 
AT 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTCG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGATGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGTTGA 
GTTTCTGTCA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ACGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAAGCT 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTGT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
YGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAGGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATTAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTAYT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 200 

GGCTTACACT 250 

ACTCTGTCAA 300 

TCTAACTTCG 350 

TCCAATCTCT 400 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1102 



2) INFORMATION FOR SEQ ID NO: 415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, diibliniensis 
40 (B) STRAIN: CBS 7987 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

45 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

50 ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

55 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

60 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

250 
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WO 01/23604 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA 
AGATTGACAG AAGAACTGGT AAGAAATTGG 
AAATCCGGTG ACGCTGCTAT YGTCAAGATG 
5 TGTTGAAGCT TTCACTGACT ACCCACCATT 
AT 



PCT/CA00/01150 

GCTGGTTAYT CTCCAGTCTT 900 
ATTCGACACT TTGGTTGAAA 950 
AAGAAAATCC AAAATTCGTC 1000 
GTCCCAACCA AACCAATGTG 1050 
AGGTAGATTC GCTGTCAGAG 1100 

1102 



10 2) INFORMATION FOR SEQ ID NO: 416 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, famata 

(B) STRAIN: ATCC 62894 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 



AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

AAATTCGAAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 150 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 2 00 

3 0 AAGGATGGTC AAACCAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 250 

TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 3 00 

AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 350 

GTTGGTTACA ACCCTAAGAC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 400 

CGGTGACAAC ATGATTGAAG CCTCCACCAA CTGTCCATGG TACAAGGGTT 450 

3 5 GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 500 

GCCATTGATG CCATTGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

ATTACCATTA CAAGATGTCT ACAAGATCGG TGGTATTGGT ACTGTGCCAG 600 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGCACCACGA 700 

40 ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTCCAAG 800 

AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 850 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 900 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

45 AGAAGATCCG GTAAGAAATT AGAAGACGAA CCAAAATTCG TCAAGTCCGG 1000 

TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 1050 

CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 1094 



50 



60 



2) INFORMATION FOR SEQ ID NO: 417 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

251 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

10 TGACTGTGCT ATCTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 250 

CTAGGTGTTA GACAATTGAT TGTYGCTGTC AACAAGATGG ACTCTGTCAA 300 

GTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA 350 

TCAAGAAGGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 4 00 

15 GGTTGGAACG GTGACAACAT GATTGAAGCC ACCACCAACG CTTCCTGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 500 

TGTTGGAAGC CATTGACGCT ATCGAACCAC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 600 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 650 

2 0 TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 8 00 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

25 GGACTGT C AC ACCGCCCACA TTGCTTGTAA GTTCGAAGAA TTGTTGGAAA 950 

AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 1100 

A HOI 

30 



2) INFORMATION FOR SEQ ID NO: 418 

35 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 108 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
45 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

50 AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 200 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 300 

55 GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 350 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 400 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

60 ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 

252 
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ACCGGTATCA TYAAGGCCGG TATGGTTGTT 
YACCACTGAA GTCAAGTCCG TGGAAATGCA 
GTGTTCCAGG TGACAATGTT GGTTTCAACG 
GAAATTAGAA GAGGTAACGT TTGTGGTGAC 
5 GGGTTGTGAC TCTTTCACCG CTCAAGTTAT 
AAATCTCTGC TGGTTACTCT CCAGTTTTGG 
GCTTGTAAAT TCGACACCTT GTTGGAGAAG 
GAAGATGGAG GACAACCCCA AGTTTGTCAA 
TCAAGATGGT GCCATCCAAG CCAATGTGTG 
10 CCACCATTGG GAAGATTCGC CGTCAGAGAC 
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ACCTTTGCCC CAGCTGGTGT 650 

CCACGAACAA TTGGTTGAAG 700 

TTAAGAACGT TTCCGTTAAG 750 

TCCAAGAACG ACCCACCAAA 800 

TGTGTTGAAC CACCCTGGTC 850 

ACTGTCACAC CGCCCACATT 900 

ATTGACAGAA GAACCGGTAA 950 

GTCCGGTGAC GCTTCTATCG 100 0 

TTGAGGCTTT CACCGACTAC 105 0 

1080 



15 



2) INFORMATION FOR SEQ ID NO: 419 



( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 

3 0 TCTGTCAAGT GGGACAAGGC CAGATACGAG GAAATCGTCA AGGAGACCTC 50 

TAACTTCGTC AAGAAGGTTG GTTACAACCC TAAGACTGTT CCATTCGTCC 100 

CAATCTCTGG TTGGAACGGT GACAACATGA TTGAGGCTTC TACCAACTGT 150 

GACTGGTACA AGGGTTGGGA GAAGGAGACC AAGTCTGGTA AGTCCACCGG 200 

TAAGACCTTG TTGGAGGCCA TTGACGCCAT TGAGCCACCA ACCAGACCAA 250 

3 5 CCGACAAGCC ATTGAGATTG CCATTGCAGG ATGTCTACAA GATTGGTGGT 3 00 

ATCGGAACTG TGCCAGTCGG CAGAGTTGAG ACCGGTGTTA TCAAGGC CGG 350 

TATGGTTGTC ACCTTCGCCC CAGCTGGTGT CACCACTGAA GTCAAGTCTG 40 0 

TCGAGATGCA CCACGAGCAG TTGCCAGAGG GTGTCCCAGG TGACAACGTT 450 

GGTTTCAACG TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT 500 

40 CTGTGGTGAC TCCAAGCAGG ACCCACCAAA GGGCTGTGAC TCTTTCACCG 550 

CTCAGGTTAT TGTGTTGAAC CACCCAGGTC AGATCTCTTC TGGTTACTCT 600 

CCAGTTTTGG ACTGTCACAC TGCCCACATT GCTTGTAAGT TCGACACCTT 650 

GGTTGAGAAG ATCGACAGAA GAACCGGTAA GAAGTTGGAA GATGAGCCAA 700 

AGTTCATCAA GTCCGGTGAC GCTGCTATCG TCAAGATGGT CCCAACCAAG 750 

45 C 751 



50 



2) INFORMATION FOR SEQ ID NO: 42 0 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Candida inconspicua 

253 
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WO 01/23604 PCT/CA00/O115O 
(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

5 TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA . ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 250 

10 TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 4 50 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

15 TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

20 CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 85 0 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 90 0 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 10 00 

25 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 421 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1099 bases 
35 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, kefyr 

(B) STRAIN: ATCC 28838 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTACCAA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

50 ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 200 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 250 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 3 00 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 4 00 

55 TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT CCATGGTACA 450 

AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGACCTTG 500 

TTGGAAGCTA TTGACGCTAT CGAACCACCA ACCAGACCAA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 650 

60 ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 

254 
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10 



25 



WO 01/23604 PCT/CA00/01150 

CCACGAACAA TTGGAAGAAG GTCTACCAGG TGACAACGTC GGTTTCAACG * 750 

TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT CTGTGGTGAC 800 

TCCAAGAACG ATCCACCAAA GGCTGCTGCT TCTTTCAACG CCACTGTTAT 850 

CGTCTTGAAC CACCCAGGTC AAATCTCTGC TGGTTACTCT CCAGTTTTGG 900 

ATTGTCACAC TGCTCACATT GCTTGTAAGT TCGACGAATT GTTGGAAAAG 950 

AACGACAGAA GATCCGGTAA GAAGTTGGAA GACTCTCCAA AGTTCTTGAA 1000 

GTCTGGTGAC GCTGCTTTGG TTAAGTTCGT TCCATCTAAG CCAATGTGTG 1050 

TTGAAGCATT CTCTGACTAC CCACCATTGG GTAGATTCGC TGTCAGAGA 1099 



2) INFORMATION FOR SEQ ID NO: 422 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 



AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

3 0 ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

CAAGGATGGT CAAACTAGAG AACACGCTCT ATTGGCTTTC ACCTTAGGTG 250 

TTAGACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAARTGGGAT 300 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 350 

35 GGTTGGTTAC AACCCAAAGA CTGTTCCATT CGTTCCAATY TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA CCTTATTAGA 500 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT ACAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

40 GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 650 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTCCGTTGAA ATGCACCATG 700 

AACAATTAGA ACAAGGTGTT CCAGGTGATA ACGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

45 TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GCTTTCACTG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 

50 

2) INFORMATION FOR SEQ ID NO: 423 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lambica 
5 (B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

10 CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

AGATTGTGCT ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 3 00 

15 GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGGGATGG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

2 0 AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GT CGGT AG AG TTGAAACCGG TATCATCAAG CCAGGTATGA 650 

TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGCACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 750 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 8 00 

25 GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGACCA TCCAAAGTCT 100 0 

GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT 1050 

30 GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 

GAGA 1104 



35 2 ) INFORMATION FOR SEQ ID NO: 424 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, lusitaniae 

(B) STRAIN: ATCC 66035 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 



GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC GATATCGCTT 50 

TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 200 

55 TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC TTACACCTTG 250 

GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC AATCTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 45 0 

60 GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT AAGACCTTGT 500 

256 
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TGGAGGCCAT TGACGCCATT GAGCCACCTT 
TTGAGATTGC CATTGCAAGA TGTCTACAAG 
GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT 
CCTTTGCTCC AGCTGGTGTC AC CACTGAAG 
5 CACGAACAAT TGGCTGAGGG TGTCCCAGGT 
CAAGAACGTT TCCGTCAAGG AAATCAGAAG 
CCAAGAACGA CCCACCAAAG GCTGCTGCTT 
GTCTTGAACC ACCCAGGTCA AATCTCCTCY 
CTGTCACACT GCYCACATTG CTTGTAAGTT 
10 TCGACAGAAG AACYGGTAAG AAGTTGGAAG 
TCYGGTGACG CTGCTATCGT CAAGATGGTC 
YGAAGCTTTC ACCGACTACC CACCATTGGG 



PCT/CA00/01150 

CGAGACCAAC CGACAAG(TCft v 55 9 

ATYGGTGGTA TTGGTACTGT 600 

CAAGGCCGGT ATGGTTGTCA 650 

TCAAGTCCGT GGAAATGCAC 700 

GACAACGTTG GTTTCAACGT 750 

AGGTAACGTC TGTGGTGACT 800 

CYTTCACTGC TCAAGTYATY 850 

GGTTACTCTC CAGTYTTGGA 900 

CGACACCTTG ATTGAGAAGA 950 

AAGAGCCAAA GTTCATCAAG 1000 

CCAACCAAGC CAATGTGYGT 1050 

TAGATTCGCT GTCAGAGA 1098 



15 

2) INFORMATION FOR SEQ ID NO: 425 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22 977 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

35 TGATTGTGCT ATCTTAATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTTTATT AGCATTCACC 250 

TTAGGTGTTA AGCAATTAAT TGTTGCTATC AACAAGATGG ACTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 4 00 

40 GGTTGGAATG GTGACAACAT GATTGAACCA TCTACTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTTAAG GGTAAGACCT 500 

TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGACAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTGT TATTAAGCCA GGTATGGTTG 650 

45 TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAACA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

CGTCAAGAAC GTCTCTGTTA AGGATATTAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCAGGTTACT CTCCAGTTTT 900 

50 AGATTGTCAC ACTGCCCACA TTGCATGTAA GTTCGATGAA TTAATCGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

A 1101 

55 



2) INFORMATION FOR SEQ ID NO: 426 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 
10 (B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

15 GGAAATTCGA AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

CACAGAGATT- TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 2 00 

CTAAGGATGG TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 2 50 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 300 

20 CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 350 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 400 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

25 AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 60 0 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTC CGTTGA AATGCACCAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 75 0 

GAACGTTTCA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

30 AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA 1050 

35 AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 1095 



40 



2) INFORMATION FOR SEQ ID NO: 427 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

55 CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 250 

60 ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 300 

258 
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TATCGGTACG GTACCTGTCG 
GCATGGTCGT CACTTTCGCC 
GTCGAGATGC ACCACGAGCA 
CGGTTTCAAC GTCAAGAACG 
TCTGCGGTGA CTCCAAGAAC 
GCCCAGGTCA TCGTTTTCAA 
CCCCGTTTTG GACTGCCACA 
TCCAGTCCAA GATGGACCGT 
AAGTTCATCA AGGCTGGTGA 
GC 



GCCGTGTCGA 
CCCGCTGGTG 
GATCCCCGAG 
TTACCGTCAA 
GACCCCCCCA 
CCACCCCGGT 
CCGCGCACAT 
CGTACTGGTA 
CTCCGCTATC 



GACCGGTATC ATCAAGCCCG 
TCACCACTGA AGTGAAGTCC 
GGTCTCCCCG GTGACAACGT 
GGATATCCGC CGTGGTAACG 
AGGGCTGCTC TTCCTTCACT 
CAGATCTCCA ACGGTTACTC 
TGCCTGCCGC TTCGACGAGA 
AGACCCTTGA GGAGAACCCC 
GTCAAGATGG TTCCCTCCAA 



350 
400 
450 
500 
550 
600 
650 
700 
750 
752 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 42 8 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1093 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

AAGTTCGAAA CTCCAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 200 

AAGGATGGTC AAAC C AG AG A ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

TAGACAATTG ATTGTTGCTG TTAACAAGAT GGATTCCGTT AAGTGGGATG 3 00 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 350 

GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 500 

GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 70 0 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 800 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 850 

GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



2) INFORMATION FOR SEQ ID NO: 42 9 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 9 

AAATTGAAGG CTGAAAGAGA AAGAGGTATC ACCATTGATA TCGCTTTGTG 50 

GAAATTCGAA ACTCCAAAAT ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTACTGGT GAATTCGAAG CTGGTATTTC 200 

TAAAGATGGT CAAACCAGAG AACACGCTTT GTTGGCTTAC ACCTTGGGTG 250 

TCAAACAATT GATTGTTGCT GTCAACAAGA TGGACTCTGT TAAATGGGAC 3 00 

AAAAACAGAT TTGAAGAAAT TATCAAGGAA ACTTCTAACT TCGTCAAGAA 3 50 

GGTTGGTTAC AACCCTAAGG CTGTTCCATT CGTTCCAATC TCTGGTTGGA 4 00 

ATGGTGACAA CATGATTGAA GCTTCTACCA ACTGTCCATG GTACAAGGGT 450 

TGGGAAAAAG AAACCAAGGC TGGTAAGGTT ACCGGTAAGA CTTTGTTGGA 500 

AGCCATTGAT GCTATTGAAC CACCTTCAAG ACCAACTGAC AAGCCATTGA 550 

GATTGCCATT GCAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTCGGTAGAG TTGAAACTGG TGTCATCAAA GCCGGTATGG TTGTTACTTT 650 

CGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCCGTCGAA ATGCACCACG 700 

AACAATTGGC TGAAGGTGTC CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TTAAAGAAAT TAGAAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGATCCA CCAAAGGGTT GTGACTCTTT CAACGCTCAA GTTATTGTCT 850 

TGAACCACCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCTC ATATTGCTTG TAAATTCGAC ACCTTGGTTG AAAAGATTGA 950 

CAGAAGAACT GGTAAGAAAT TGGAAGAAAA TCCAAAATTC GTCAAATCCG 1000 

GTGATGCTGC TATTGTCAAG ATGGTTCCAA CCAAACCAAT GTGTGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTCA GAGA 1094 



2) INFORMATION FOR SEQ ID NO : 43 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 

CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT CACTATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCAAAG TACCACGTTA CTGTCATTGA TGCCCCAGGT 10 0 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 15 0 

TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 200 

CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 250 

GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 300 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 350 

AGGTTGGTTA CAACCCAAAG ACTGTTCCAT TTGTCCCAAT TTCYGGTTGG 40 0 

AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 450 

260 
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TTGGGAGAAG GAGACCAAGG CTGGTGTTGT CAAGGGTAAG ACCTTGCTCG 1 &6 

ATGCCATTGA CGCCATTGAG CCACCAACAA GACGAACTGA CAAGCCATTG 550 

AGATTGCCAC TCCAGGATGT CTACAAGATT GGTGGTATCG GAACTGTTCC 600 

AGTCGGTAGA GTCGAGACCG GTGTCATCAA GCCAGGTATG GTTGTTACCT 650 

5 TTGCCCCATC CGGTGTCACC ACTGAGGTTA AGTCGGTCGA GATGCACCAC 700 

GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC AACGTTGGTT TCAACGTTAA 750 

GAACGTCTCT GTTAAGGAGA TCAGAAGAGG TAACGTTGCC GGTGACTCCA 8 00 

AGAACGACCC ACCACAGGGT GCTGAGTCCT TCAACGCTCA GGTCATTGTC 850 

TTGAACCACC CAGGTCAGAT CTCTGCTGGT TACTCTCCAG TTTTGGACTG 900 

10 TCACACCGCC CACATTGCTT GTAAGTTCTC TGAGCTTTTG GAGAAGATTG 950 

ACAGAAGATC CGGTAAGTCC CTTGAGGCCT CTCCAAAGTT CGTCAAGTCT 1000 

GGTGATGCCG CTATCGTCAA GATGGTTCCA TCCAAGCCAT TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CACTCGGTAG ATTCGCTGTC AGAGA 1095 

15 



2) INFORMATION FOR SEQ ID NO : 431 

( i ) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 2 8269 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 



GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 10 0 

35 TCATCAAGAA CATGATYACT GGTACTTCTC AAGC TGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 2 00 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 250 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 3 00 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 350 

40 CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 400 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 450 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 500 

CGCTATCGAA CCACCAACCA GACGAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

45 GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 70 0 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 850 

50 CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA ACCCAAAGTT TGTCAAGTCC GGTGACGCTG 100 0 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 

55 



2) INFORMATION FOR SEQ ID NO: 432 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1072 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 
10 (B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 

AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

15 AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 200 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 250 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 300 

20 CAAGGAGACC TCCAACTT CG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 350 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 400 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 45 0 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

25 AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 750 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 8 00 

3 0 CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 850 

GCTGGTTACT CTCCGGTTTT- GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

35 GGGTCGTTTC GCTGTCAGAG AC 1072 



2) INFORMATION FOR SEQ ID NO: 433 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 

55 AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 150 

CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGCACACTGG 2 00 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

60 CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 300 

262 
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ATCGGAACAG TCCCAGTCGG 
TATGGTTGTG ACCTTCGCTC 
TCGAAATGCA CCACCAGCAG 
GGCTTCAACG TCAAGAACGT 
5 CGCTGGTGAC TCCAAGAACG 
CCCAGGTCAT CGTCCTCAAC 
CCAGTCCTTG ACTGCCACAC 
CCTCGAGAAG ATCGACCGCC 
AGTTCATCAA GT CTGGTG AT 
10 C 



CCGTGTCGAA ACCGGTGTTA 
CTTCCAACGT CACCACTGAA 
CTCACCCAGG GTAACCCTGG 
CTCTGTCAAG GAAGTCCGCC 
ACCCACCAAA GGGCTGCGAC 
CACCCTGGTC AAGTCGGTGC 
TGCCCACATT GCTTGCAAGT 
GTACCGGTAA ATCCGTTGAG 
GCCGCTATCG TCAAGATGGT 



TCAAGCCTGG * 350 

GTCAAGTCCG 40 0 

TGACAACGTT 450 

GCGGTAACGT 50 0 

TCCTTCAACG 550 

TGGTTATGCC 600 

TCTCCGAGCT 650 

AACAACCCCA 7 00 

TCCATCCAAG 750 

751 



2) INFORMATION FOR SEQ ID NO: 434 



15 



( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Czryptococcus albidus 

(B) STRAIN: ATCC 6603 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 

3 0 AAGCTCAAGG CCGAGCGAGA GCGAGGTATC ACCATCGACA TCGCCTTGTG 50 

GAAGTTCGAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC GCCCCCGGTC 100 

ACCGAGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA GGCCGACTGT 150 

GCCATCCTCA TCATCGCCTC CGGTATCGGA GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGACGGT CAGACCCGAG AGCACGCCCT TTTGGCCTTC ACCCTCGGTG 2 50 

35 TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG CAAGGTTAGT 3 00 

TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT TTACAGTGGT 350 

CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG TTTCATCAAG 4 00 

AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA TCTCCGGATG 450 

GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC TGGTACAAGG 500 

40 GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA GACCTTGCTC 550 

GAGGCCATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG ACAAGCCCTT 600 

GCGATTGCCC CTCCAGGATG TCTACAAGAT CGGTGGTATC GGTACGGTGC 650 

CCGTCGGTCG AGTCGAGACC GGTGTCATCA AGGCTGGTAT GGTCGTCACC 700 

TTCGCCCCTG CCAACGTCAC CAC CGAAGTC AAGTCCGTCG AAATGCACCA 750 

45 CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT TTCAACGTCA 800 

AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG CTCCGACTCG 8 50 

AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC AGGTCATTGT 900 

CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC GTTTT<3GACT 950 

GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT TGAGAAGATC 1000 

50 GACCGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT TCGTCAAGTC 1050 

CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC ATGTGTGTCG 1100 

AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT CCGAGA 1146 



55 

2 ) INFORMATION FOR SEQ ID NO: 435 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
60. (B) TYPE: Nucleic acid 

263 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala. jeanselmex 

(B) STRAIN: ATCC 64755 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 

CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 10 0 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

15 CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 2 00 

CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 250 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 300 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TT CATC AAGA 350 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 400 

20 AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 60 0 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 6 50 

25 TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCAC CAC 700 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 800 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 900 

3 0 CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 

GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 



35 



2 ) INFORMATION FOR SEQ ID NO: 436 

( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

55 TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 200 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 3 00 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC ACCACCAAGT 350 

60 GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGAC CTC CTCTTTCATC 4 00 

264 
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AAGAAGGTCG GCTACAACCC CAAGGCTGTC 
TTTCAACGGT GACAACATGC TTACCCCCTC 
AGGGTTGGGA GCGTGAGATC AAGTCCGGCA 
CTCGAGGCCA TTGACTCCAT CGAGCCCCCC 
5 CCTTCGTCTT CCCCTTCAGG ATGTCTACAA 
TTCCCGTCGG CCGTATCGAG ACTGGTGTCA 
ACCTTCGCTC CTTCCAACGT CACCACTGAA 
CCACGAGCAA CTCACTGAGG GCCAGCCCGG 
TGAAGAACGT CTCCGTCAAG GACATCCGAC 

10 TCCAAGAACG ACCCCCCTAT GGGTGCCGCT 
CGTCCTCAAC CACCCCGGTC AGGTCGGTGC 
ATTGTCACAC TGCCCACATT GCCTGCAAGT 
ATCGACCGCC GAACCGGTAA GGCTACTGAG 
GTCTGGTGAC TCCGCCATCG TCAAGATGGT 

15 TTGAGGCTTT CAC 



PCT/CA00/01150 

GCTTTCGTCC CCATCTCCGG 450 

CACCAACTGC CCCTGGTACA 500 

AGCTCACTGG CAAGACCCTC 550 

AAGCGTCCCG TTGACAAGCC 600 

GATCGGTGGT ATTGGAACGG 650 

TCAAGCCCGG TATGGTCGTT 700 

GTCAAGTCCG TCGAGATGCA 750 

TGACAACGTT GGTTTCAACG 800 

GTGGTAACGT CGCTGGTGAC 850 

TCTTTCACCG CCCAGGTCAT 900 

TGGTTACGCT CCCGTCCTCG 950 

TCGCCGAGAT CCAGGAGAAG 1000 

GCCGCCCCCA AGTTCATCAA 1050 

TCCCTCCAAG CCCATGTGTG 1100 

1113 



20 



25 



2) INFORMATION FOR SEQ ID NO : 437 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichum spp. 

(B) STRAIN: LEV- 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 



35 GGTCCGAGGA 
AAGAAGGTTG 
TTGGAACGGT 
AGGGATGGCA 
CTTGAGGCCA 

40 CCTCCGTCTT 
TGCCCGTCGG 
ACCTTCGCCC 
CCACGAGCTC 
TCAAGAACGT 

45 TCCAAGAACG 
TATCTTCAAC 
ATTGCCACAC 
ATTGACCGTC 
GGCCGGTGAT 

50 



CAGATTCAAC 
GTTAYAACCC 
GACAACATGA 
GAAGGAGACC 
TCGATGCCAT 
CCCCTCCAGG 
CCGTGTCGAA 
CCGCTGGTGT 
CTCACTGAGG 
CTCCGTTAAG 
ATCCCCCCAA 
CACCCTGGTC 
CGCCCATATT 
GTACTGGTAA 
GCTGCTATCG 



GAGATTGTCA 
CAAGACTGTT 
TTGAGCCCTC 
AAGGCTGGTG 
TGAGCCCCCT 
ATGTCTACAA 
ACCGGTGTCA 
CACCACTGAA 
GTCTCCCCGG 
GATATCAGAC 
GGCTTGCGCT 
AGATCTCTGC 
GCTTGCAAGT 
GAAGACTGAG 
TCAAGA 



AGGAGACTTC 
GCTTTCGTCC 
CACCAACTGC 
TCACTAAGGG 
GTCAGACCTT 
GATCGGTGGT 
TCAAGGCCGG 
GTCAAGTCCG 
TGACAACGTT 
GTGGTAACGT 
TCTTTCAACG 
TGGATACTCT 
TCGACACTTT 
GACTCCCCCA 



CAACTTCATC 50 

CCATCTCTGG 100 

CCCTGGTACA 150 

TAAGACCCTC 200 

CCGACAAGCC 250 

ATCGGAACTG 3 00 

TATGGTCGTC 3 50 

TCGAGATGCA 400 

GGTTTCAACG 450 

CTGCGGTGAC 500 

CCCAGGTCAT 550 

CCCGTCCTTG 600 

GATCGAGAAG 650 

AGTTCGTCAA 700 

726 



2) INFORMATION FOR SEQ ID NO : 43 8 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

265 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
5 (B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

10 CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CCAATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 2 00 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 3 00 

15 TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 3 50 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 50 0 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

2 0 GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 

2) INFORMATION FOR SEQ ID NO : 43 9 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalis 
40 (B) STRAIN: ATCC 6258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

45 CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 200 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA ACCGAAAAGC 250 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 300 

50 GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 3 50 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 400 

ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 4 50 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 500 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

55 TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 60 0 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 



60 
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2) INFORMATION FOR SEQ ID NO: 44 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 



CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 50 

GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 100 

2 0 CAC CGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 150 

CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 2 00 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 250 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 300 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 3 50 

25 AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 4 00 

CACGGTGACA ACATGATCGA GGCCACCACC AACATGCCTT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 500 

ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 600 

3 0 TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 650 

TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC 700 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 750 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 800 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 850 

3 5 ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 900 

CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 95 0 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 10 00 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 1050 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 1091 

40 



2) INFORMATION FOR SEQ ID NO: 441 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 
55 (B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT ACTCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
60 GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 100 

267 
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CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC 'CACCT^CATG 150 

CCGTGGTACA AGGGCTGGGA GAAGGAGACC AAGTCGGGCA AGGCCACTGG 200 

TAAGACCCTT CTGGACGCTA TTGACGCCAT TGAGCCGCCG ACGCGCCCGA 250 

CCGACAAGCC TCTCCGTCTT CCTCTCCAGG ATGTGTACAA GATCGGTGGT 3 00 

5 ATCGGTACYG TCCCGGTCGG CCGTGTTGAG ACCGGTGTTA TCAAGCCCGG 350 

TATGGTTGTG ACCTTCGCTC CGTCGAACGT CAC S ACTGAA GTTAAGTCGG 400 

TCGAGATGCA CCACGAGCAG ATCCCTGAGG GTCTTCCGGG TGACAACGTT 450 

GGTTTCAACG TGAAGAACGT GTCGGTCAAG GACATTCGCC GTGGTAACGT 500 

CGCCTCGGAC TCGAAGAACG ACCCGGCTCA GGAGGCTGCC TCGTTCAATG 550 

10 CTCAGGTCAT TGTGATGAAC CACCCTGGTC AGATCAGCAA CGGTTACTCG 600 

CCRGTGCTCG ACTGCCACAC TGCTCACATT GCCTGCCGCT TCAACAACAT 650 

CCTCCAGAAG ATCGACCGTC GTTCGGGTAA GGTTCTYGAA GAGAACCCCA 700 

AGTTCATCAA GTCGGGTGAC GCTGCCATGG TTGAGATGAT CCCGACCAA 749 



15 



30 



2) INFORMATION FOR SEQ ID NO: 442 



( i ) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea. filamentosa 

(B) STRAIN: ATCC 48174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

35 TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 200 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 3 00 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

40 TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 450 

TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

45 CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 700 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

50 TCCAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

55 TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



60 



2) INFORMATION FOR SEQ ID NO: 443 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 



15 GGACAAGTTG AAGGCTGAGA GAGAGAGAGG TATCACCATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 200 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 2 50 

20 GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 3 00 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 50 0 

25 TGGAGGCCAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG* ATCGGTGGTA TCGGAACGGT 600 

GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 700 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

30 CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

35 TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



40 2) INFORMATION FOR SEQ ID NO: 444 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

55 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 2 00 

60 GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 
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TGGTATCTCC AAGGATGGCC 
CCCTCGGTGT TAAGCAGCTC 
AAGTGGTCTG AGGCCCGTTT 
CATCAAGAAG GTCGGCTACA 
5 CTGGTTTCCA CGGCGACAAC 
TACAAGGGCT GGGAGAAGGA 
CCTCCTTGAG GCCATCGACT 
AGCCCCTCCG CCTTCCCCTT 
ACAGTCCCTG TCGGCCGTAT 

10 CGTGACCTTC GCTCCTTCCA 
TGCACCACGA GCAGCTCTCC 
AACGTCAAGA ACGTCTCCGT 
TGACTCCAAG AACGACCCCC 
TCATCGTCCT CAACCACCCC 

15 CTCGACTGCC ACACCGCCCA 
GAAGATCGAC CGCCGTACCG 
TCAAGTCTGG CGACTCTGCC 
TGCGTTGAGG CTTTCACCGA 
TGA 

20 



AGACCCGTGA GCACGCTCTG 
ATCGTCGCTA TCAACAAGAT 
CCAGGAGATC ATCAAGGAGA 
ACCCCAAGAC CGTCGCTTTC 
ATGCTTTCCC CCTCCACCAA 
GACCAAGGCT GGCAAGTCCA 
CCATCGAGCC CCCCAAGCGC 
CAGGATGTGT ACAAGATCGG 
CGAGACTGGT GTCATCAAGC 
ACGTCACCAC CGAAGTCAAG 
GAGGGTGTCC CCGGTGACAA 
CAAGGAGATC CGTCGTGGCA 
CTCTGGGTGC CGCTTCTTTC 
GGCCAGGTCG GTGCTGGCTA 
CATTGCCTGC AAGTTCGCCG 
GCAAGTCTGT CGAGTCCGCC 
ATCGTCAAGA TGATTCCCTC 
CTACCCTCCT CTGGGCCGCT 



PCT/CA00/01150 



CTCSCMKCA" 


5CKT 


GGACACCACC 


350 


CCTCCAACTT 


400 


GTCCCCATCT 


450 


CTGCCCCTGG 


500 


CCGGCAAGAC 


550 


CCCAGCGACA 


600 


CGGTATCGGC 


650 


CCGGCATGGT 


700 


TCCGTTGAGA 


750 


CGTCGGCTTC 


800 


ACGTCGCCGG 


850 


GATGCCCAGG 


900 


CGCCCCCGTC 


950 


AGATCAAGGA 


1000 


CCCAAGTTCA 


1050 


CAAGCCCATG 


1100 


TCGCCGTCCG 


1150 




1153 



2) INFORMATION FOR SEQ ID NO: 445 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
35 (B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

40 CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 2 00 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 2 50 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 3 00 

45 TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 3 50 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 4 50 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 5 00 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

50 GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 7 00 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 

GCCCATGTGC GTC 7 63 

55 

2) INFORMATION FOR SEQ ID NO: 446 
60 (i) SEQUENCE CHARACTERISTICS: 

270 
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(A) LENGTH: 1346 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium mameffei 
10 (B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

15 GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 10 0 

ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 2 00 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 2 50 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 3 00 

2 0 CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA "3 50 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 450 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 500 

GTTGGGAGAA GGAGACCAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 550 

2 5 GAGGCCATCG ACGC CATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 700 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 7 50 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 800 

30 TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

3 5 GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 12 00 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 12 50 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 13 00 

40 TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 1346 



45 



2) INFORMATION FOR SEQ ID NO: 447 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Plchia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

60 AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
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GAAATTCGAA ACTCCAAAAT ACCATGTTAC CGTTATTGAT GCTCCAGGTC 100 
ACAGAGATTT CATCAAAAAT ATGATTACTG GTACTTCCCA AGCTGATTGT 150 
GCTATYTTAA TTATTGCCGG TGGTATTGGT GAATTCGAAG CTGGTATCTC 2 00 

AAAAGATGGT CAAACCAGAG AACACGCTTT NTTAGCTTAC ACCTTAGGTG 250 
5 TTAAACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAAATGGGAT 3 00 

GAATCTCGTT TCGAAGAAAT TGTCAAGGAA ACYTCAAACT TTATCAAGAA 3 50 

AGTTGGTTAC AACCCAAAAA CTGTTCCATT CGTTCCAATC TCAGGTTGGA 4 00 

ATGGTGATAA CATGATTGAA CCATCAWCTA ACTGTCCATG GTACAAAGGT 450 
TGGAAAAAAG AAACCAAAGC TGGTGAAGCT AAAGGTAAAA CTTTATTAGA 500 

10 AGCCATTGAT GCTATTGACC CACCATCAAG ACCAACTGAT AAACCATTAC 550 
GTTTACCATT ACAAGATGTT TACAARATTG GTGGTATTGG TACTGTGCCA 600 
GTCGGTAGAG TTGAAACCGG TGTTATCAAA CCAGGTATGG TTGTTACCTT 650 
TGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCTGTTGAA ATGCATCATG 700 
AACAATTGAC TGAAGGTTTA CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

15 AATGTTTCTG TTAAAGAAAT CCGTCGTGGT AACGTCTGTG GTGACTCTAA 800 
AAACGATCCA CCAAAAGCTG CTGAATCATT CAATGCTCAA GTTATTGTCT 850 
TAAACCATCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT TTTAGATTGT 900 
CACACTGCTC ACATTGCTTG TAAATTTGAC ACTTTAATTG AAAAAATTGA 950 

CAGACGTACT GGTAAGAAAT T AG AAGAAG C TCCAAAATTC ATCAAATCAG 1000 

2 0 GTGATGCTGC TATTGTTAAA TTTGTT C CAT CAAAACCATT ATCAGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTCGT TTCGCTGTCA GAGA 1094 



25 2) INFORMATION FOR SEQ ID NO: 448 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 



CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGC CGGTGG TATTGGTGAA TTCGAAGCTG 2 00 

45 GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 250 

TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 3 00 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 400 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 4 50 

50 AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 6 00 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

55 AT C ATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 7 50 

GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

60 AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 
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AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA ^ 1050 
GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 1100 



2) INFORMATION FOR SEQ ID NO: 449 

( i ) SEQUENCE CHARACTER ISTICS : 
(A) LENGTH: 1201 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTCTTTG 10 0 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GCACAATTCC AGCCCTCGAT 150 

25 AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 200 

CCCGGCCATC GTGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC 250 

TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 300 

GCATCTCCAA GGATGGCCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 3 50 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG ACACCAACAA 4 00 

3 0 CTGGTCCGAG GACCGATACA AGGAGATCAT CAAGGAGACC TCCAACTTCA 450 

TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

GGTTTCCACG GAGACAACAT GCTTACCCCC TCCACCAACT GCCCCTGGTA 550 

CAAGGGTTGG GTCCGTGAGG TCAAGGGTAA CACCCTTACC GGCAAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

35 CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGATCGGTG GTATTGGCAC 700 

TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGGTCG 750 

TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 8 00 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 850 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

40 ACTCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGACCG CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

45 TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 1200 

A 1201 



50 2) INFORMATION FOR SEQ ID NO: 450 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE : 
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PCT/CA00/01150 



(A) ORGANISM: Rhizopus oryzae 

(B) STRAIN: ATCC 56015 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

5 

CAAGCTTAAG GCTGAACGTG AACGTGGTAT CACCATCGAT ATTGCTCTCT 50 

GGAAGTTCGA AACCCCCAAG TACCAAATTA CCGTTATTGA TGCTCCCGGT 100 

CACCGTGATT TCATCAAGAA CATGATTACC GGTACTTCTC AAGCCGATTG 150 

TGCTATTCTT ATCATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

10. CCAAGGATGG TCAAACCCGT GAACACGCCC TTTTGGCTTT CACTCTCGGT 250 

GTCCGTCAAT TGATTGTTGC TGTCAACAAG ATGGATACCA CCAAGTGGTC 3 00 

CGAAGCTCGT TTCAACGAAA TYGTCAAGGA AGTTTCTTCC TTCATCAAGA 350 

AGATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT CTCTGGTTGG 400 

CACGGTGACA ACATGTTGGA AGAATCTACC AACATGCCCT GGTACAAGGG 450 

15 ATGGAACAAG GAAACCAAGG CTGGTGCCAA GTCTGGTAAG ACCCTCTTGG 500 

ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA CAAGCCTCTC 550 

CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG GTACTGTCCC 600 

CGTCGGTCGT GTCGAAACTG GTGTCATCAA GGCTGGTATG GTTGTCACCT 650 

TCGCTCCTGC TGCTGTCACC ACTGAAGTTA AGTCCGTCGA AATGCACCAC 700 

20 GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA AGTCATTATC 850 

TTGAACCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG TTTTGGATTG 90 0 

TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC GAAAAGATTG 950 

25 ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT CGTCAAGTCT 1000 

GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA TGTGTGTTGA 1050 

AGC TT AC ACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT CGTGA 1095 



30 



40 



2) INFORMATION FOR SEQ ID NO: 451 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1092 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotoz~ula minuta 

(B) STRAIN: ATCC 10658 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 

GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 10 0 

CGTGATTTCA TCAAGAGTGA GTTAACCATA ACATCAAACA GTGTTGCAAA 150 

50 CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

GGC CGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 30 0 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTCACTTAC 4 00 

55 CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTCGGATTCG ACCCCAAGGG 500 

TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 600 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 650 

60 CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 700 
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ACAAGATTGG TGGTATCGGA ACAGTGCCAG TCGGACGAGT CGAGACTGGT 750 

ACCATCAAGG CTGGTATGGT CGTCACCTTC GCTCCTTCAG CTGTCACCAC 800 

CGAAGTCAAG TCTGTTGAGA TGCACCACGA GCAGCTCGAG GCTGGTCTTC 850 

CAGGTGACAA CGTCGGATTC AACATCAAGA ACGTTTCAGT CAAGGATATC 900 

5 CGAAGAGGAA ACGTCTGCGG TGACTCCAAG AACGATCCCC CCAAGGAGGC 950 

TGCTTCCTTC ACCGCCCAGG TCATTGTCCT CAACCACCCC GGTCAAATCG 1000 

GTAACGGATA CTCTCCAGTT CTCGATTGCC ACACTGGTGA GTCATTCTTC 1050 

CATATTAGTT TGAACTCTTT TGAACAATAC TAACGTGAAT CATTATACTT 1100 

TTCAGCCCAC ATTGCATGCA AGTTCGACAC CCTCCTAGAG AAGATTGACC 1150 

10 GACGATCCGG AAAGTCCATC GAAGATACCC CCAAGTTCGT CAAGTCTGGT 12 00 

GACGCCGCCA TCGTCAAGAT GGTCCCCACC AAGCCAATGT GCGTTGAGGC 1250 

TTTCACCGAC TACCCACCTC TTGGACGATT CGCCGTCCGT GA 12 92 



15 



25 



2 ) INFORMATION FOR SEQ ID NO: 452 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 128 9 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmani color 

(B) STRAIN: ATCC 32311 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 100 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

35 GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 2 00 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 250 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 4 00 

40 TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 450 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

45 CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 
GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA . 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

50 CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTCCGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

55 GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 1200 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



60 
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2) INFORMATION FOR SEQ ID NO: 4 53 



PCT/CA00/01150 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1070 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG TATCACCATC GATATTGCTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

2 0 CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 200 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 250 

CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 300 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 350 

AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 400 

25 TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG AAGGAGGGCA 4 50 

AGAGCGGCAA GGTTACCGGT AAGACTCTGC TGGACGCCAT TGACGCCGTC 500 

GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT . CCCTGTCGGC CGTATCGAGA 600 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 650 

30 ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 700 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGGCAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC GCCCACATTG 900 

35 CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG CCGCCATTGT 1000 

CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 1070 



40 



2 ) INFORMATION FOR SEQ ID NO: 454 

( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephem.oa.scus ciferrii 

(B) STRAIN: ATCC 52550 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 



CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

60 GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 

276 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/O1 150 

ATCCTTATCA TTGCTGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA " 2 0tf 
GGACGGTCAG ACCAGAGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 2 50 

GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 3 00 

GATCGTTTCA ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 3 50 

5 TGGTTACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 400 
GTGACAATAT GATTGAGC C A TCCACCAACT GCCCATGGTA CAAGGGTTGG 4 50 

GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 
CATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC 550 
TTCCTCTTCA GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

10 GGTCGTGTTG AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 650 
CCCAGCTGGT GTCACCACTG AAGTCAAGTC TGTCGAGATG CACCACGAAC 700 
AGATCCCAGA AGGTACCCCA GGTGACAACG TTGGTTTCAA CGTCAAGAAC 750 
GTCTCCGTCA AGGAAATCAG ACGTGGTAAC GTTACTGGTG ACTCCAAGAA 800 
CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTCA 8 50 

15 ACCACCCTGG TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 
ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 950 

ACGTTCCGGT AAGAAGGTCG AAGACTCTCC TAAGTTCGTC AAGGCCGGTG 1000 

ATGCCGCCAT TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 1092 

20 



2) INFORMATION FOR SEQ ID NO: 455 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1149 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton menta.grophyt.es 
35 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

40 GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 2 00 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 300 

4 5 TGTCAAGCAG CTCAT CGTTG CCATCAACAA GATGGACACC ACCAACTGGT 3 50 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 400 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

50 CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

55 TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

60 GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

277 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC~ CGTCCGTGA 1 fl^9 



5 2") INFORMATION FOR SEQ ID NO: 456 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 



TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 
CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 
CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 
CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 2 00 

2 5 GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 2 50 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 300 
GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 
TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 4 00 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 450 

3 0 CAAGGGATGG AC CAAGG AAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 50 0 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 550 
CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 
TGTGCCAGTC GGCCGTGTCG AAAC CGGTAT CATCAAGGCC GGTATGGTCG 650 
TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

35 CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 
CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 
ACTCCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 
ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 
CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

40 AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



2) INFORMATION FOR SEQ ID NO: 457 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

60 

278 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 

GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

5 CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATCTTGATC ATTGCCGCCG GT AC CGGTGA ATTCGAAGCC GGTATCTCCA 2 00 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 300 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

10 TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 450 

GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 500 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 600 

15 CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 700 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 8 00 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

20 AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 10 0 0 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 1085 

25 



2) INFORMATION FOR SEQ ID NO: 458 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
40 (B) STRAIN: DAL-95 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

45 TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 2 00 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 3 00 

50 CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 3 50 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 4 00 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 492 

55 

2 ) INFORMATION FOR SEQ ID NO: 459 

( i ) SEQUENCE CHARACTERI STICS : 
60 (A) LENGTH: 1154 bases 

279 



SUBSTITUTE SHEET (RULE 26} 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 



GTCCGTGGTC AAGAAGTTAT TGACACTGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

15 ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 2 00 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 2 50 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 3 00 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 350 

20 CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 4 00 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

25 CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 700 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 750 

GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTCTCG 800 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

30 AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 900 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 1000 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

35 CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 1150 

AAGC 1154 



40 2) INFORMATION FOR SEQ ID NO: 460 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 



CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATAC CG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

60 GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 250 

280 



SUBSTITUTE SHEET (RULE 26) 
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GGTCCAATTG 
ATTCGTTGAA 
TTGTCGACTT 
GGTGGTGCTG 
5 CATTGCCAAA 
GAACCAGAGA 
ATCAACTTGG 
CGAACCACCA 
CTGAATACTT 

10 AACATTTTCA 
TCGTATTCCA 
GTCTTTTGCA 
GTCCAAGCTG 
CACTACATTC 

15 CTGAATTGGG 
TTATTGGACG 
TGTTCAACAA 
TTTTGGGTAT 
GCCCGTAAGA 

20 TTTCACTGGT 
CATTCAAGGA 



AATGTAAGGA 
CAATCCACTG 
GTTGGCCCCA 
GTGTCGGTAA 
GCCCATGGTG 
AGGTAACGAT 
AAGGTGACTC 
GGTGCTAGAG 
CAGAGATGAA 
GATTCACCCA 
TCTGCTGTCG 
AGAACGTATT 
TCTATGTCCC 
GCCCATTTGG 
TATCTACCCA 
CTTCTGTTGT 
ACTTTACAAG 
GGATGAATTG 
TCCAAAGATT 
ATCCCAGGTA 
TGTTTTGGAA 



AAAGAAACCA 
CTGCCGARAT 
TACGCCAGAG 
GACCGTCTTT 
GTTTCTCTGT 
TTGTACCGTG 
CAAGGTCGCT 
CTAGAGTTGC 
GAAGGTCAAG 
AGCTGGTTCC 
GTTATCAACC 
ACCACCACCA 
AGCTGATGAT 
ATGCCACTAC 
GCTGTCGATC 
TGGTCAAGAA 
CTTACAAATC 
TCTGAAGCTG 
CTTGTCTCAA 
GATTAGTCAG 
GGTAAATACG 



ATTCACGCTG 
TTTGGAAACC 
GTGGTAARAT 
ATCCAAGAAT 
CTTTACCGGT 
AAATGAAAGA 
TTGGTCTTCG 
TTTGACTGGT 
ATGTCTTGTT 
GAAGTGTCTG 
AACCTTAGCC 
AGAAAGGTTC 
TTGACCGATC 
TGTCTTGTCT 
CATTGGATTC 
CATTACGATG 
CTTACAAGAT 
ATAAATTGAC 
CCATTCGCTG 
ATTGCAAGAC 
ATAACTTGCC 



AACCACCATC 3 00 

GGTATCAAGG 350 

TGGTTTATTC 4 0 0 

TGATTAACAA 4 50 

GTYGGTGAAA 500 

AACCGGTGTC 550 

GTCAAATGAA 600 

TTGACCATTG 650 

GTTCATTGAT 700 

CTTTGTTAGG 750 

ACTGATATGG 8 00 

CGTCACCTCT 85 0 

CTGCTCCAGC 900 

AGAGGTATTT 950 

CAAATCCAGA 10 0 0 

TCGCTACTGG 1050 

ATCATTGCTA 110 0 

TGTCGAAAGA 1150 

TTGCTGAAGT 12 0 0 

ACTGTCAAAT 1250 

AGAAA 1295 



25 2) INFORMATION FOR SEQ ID NO: 461 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1277 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3 949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 



TAACTTGCCA 
ATTTAGTTTT 
ATTGCTATGG 
TACCGGTGCC 
TCATCAATGT 
AAGGAAAAGA 
CACTGCTGCC 
CCCCATACGC 
GGTAAGACCG 
TGGTGGTTTC 
ACGATTTGTA 
GACTCCAAGG 
TAGAGCTAGA 
ATGAAGAAGG 
ACCCAAGCTG 
CGTCGGTTAT 
GTATTACCAC 
GTCCCAGCTG 
TTTGGATGCC 
ACCCAGCTGT 



GCTATTTTGA 
GGAAGTTGCT 
ATGGTACTGA 
CCAATCTCCG 
TGTTGGTGAA 
AACCAATTCA 
GAAATTTTGG 
CAGAGGTGGT 
TCTTTATCCA 
TCCGTCTTTA 
CCGTGAAATG 
TCGCTTTGGT 
GTTGCTTTGA 
TCAAGATGTC 
GTTCCGAAGT 
CAACCAACCT 
CACCAAGAAA 
ATGATTTGAC 
ACTACTG TCT 
CGATCCATTG 



ATGCTTTGAC 
CAACATTTGG 
AGGTTTAGTC 
TTCCAGTCGG 
CCAATTGATG 
TGCAGAACCA 
AAACCGGTAT 
AAGATTGGTT 
AGAATTGATT 
CCGGTGTCGG 
AAAGAAACCG 
CTTTGGACAA 
CTGGTTTGAC 
TTGTTGTTCA 
GTCTGCTTTG 
TAGCTACTGA 
GGTTCCGTCA 
CGATCCTGCT 
TGTCTAGAGG 
GATTCCAAAT 
2 



TTTGAAGAAC 
GTGAAAACAC 
AGAGGTACTG 
TAGAGGTACC 
ACAGAGGTCC 
CCATCCTTCG 
CAAGGTTGTC 
TGTTCGGTGG 
AACAACATTG 
TGAAAGAACC 
GTGTCATCAA 
ATGAACGAAC 
TATTGCTGAA 
TCGATAACAT 
TTAGGTCGTA 
TATGGGTCTT 
CCTCTGTCCA 
CCAGCCACCA 
TATTTCTGAA 
CCAGATTATT 
81 



GGTGACCAAG 50 

CGTCAGAGCT 10 0 

AAGTCAACGA 15 0 

TTAGGTAGAA 200 

AATTGAATGT 250 

TTGAGCAATC 3 00 

GACTTATTGG 350 

TGCTGGTGTC 400 

CTAAAGCCCA 450 

AGAGAAGGTA 50 0 

CTTAGAAGGT 550 

CACCAGGTGC 600 

TACTTCAGAG 650 

TTTCAGATTC 700 

TTCCATCTGC 750 

TTGCAAGAAC 800 

AGCTGTCTAT 850 

CATTCGCCCA 900 

TTGGGTATTT 950 

GGACGCTGCC 100 0 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGTTGGTC AAGAACATTA TGATGTCGCT 
GCAAGCTTAC AAATCCTTAC AAGATATCAT 
AATTGTCTGA AGCTGATAAA TTGACTGTCG 
AGATTCTTGT CTCAACCATT CGCCGTTGCT 
5 AGGTAGATTA GTCAGATTGC AAGACACTGT 
TGGAAGGTAA ATACGATCAC TTGCCAG 



PCT/CA00/01150 

ACTGGTGTTC AACAAACT^T 1050 
TGCTATTTTG GGTATGGATG 1100 
AAAGAGCCCG TAAGATTCAA 1150 
GAAGTTTTCA CTGGTATTCC. 1200 
CAAATCATTC AAGGATGTTT 1250 

1277 



10 2) INFORMATION FOR SEQ ID NO: 462 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



GTAACTTGCC AGCTATTTTG AACGCTTTGA CCTTGAAGAA CGGTGAAAAC 50 
GACTTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 
TATTGCTATG GATGGTACTG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 
ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 2 00 

30 ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 250 
CAAGGAAACC AGACCAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 300 
CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 350 
GCCCCTTACG CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGC CGGTGT 400 
CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 450 

35 ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 
AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 550 
TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 
CTAGAGCTAG AGTTGCTTTA ACCGGTTTAA CCATTGCCGA ATACTTCAGA 650 
GACGAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 700 

4 0 CACCCAAGCC GGTTCCGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 750 
CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 8 00 

AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 850 
CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 
ACTTGGATGC TACCACTGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

45 TACCCAGCTG TCGATC C ATT GGATTCCAAA TCCAGATTGT TAGATGCTGA 1000 

TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTCTG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC ACCGGTATCC 12 00 

50 CAGGTAGATT AGTTAGATTG CAAGACACTG TTAAATCTTT CAAGGAAGTC 1250 

TTAGAAGGTA AATATGATCA CTTACCAG 1278 



55 2) INFORMATION FOR SEQ ID NO: 463 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
SO (C) STRANDEDNESS: Double 

282 
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WO 01/23604 



PCT/CA00/01150 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 

10 

TCTGGTCAGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 50 

CTGTCGGCAG AGAGACCCTG GGCAGAATCA TCAACGTTAT CGGTGAACCT 100 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 2 00 

15 CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 250 

ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 3 00 

ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTTCACAG 350 

GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 4 00 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 4 50 

2 0 CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC GCCTTGACCG 500 

GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 550 

CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC AAAGAAGGGT 700 

25 TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 750 

TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTGCTAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 900 

CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

3 0 ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 1000 

ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 1050 

TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 1100 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 1150 

CCAG H54 

35 



2 ) INFORMATION FOR SEQ ID NO: 464 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
50 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 

CCACTACGAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 50 

55 AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

TGCCAGCGTC ACTGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 200 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 250 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 300 

60 GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 350 
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TTGTCGACTT 

GGTGGTGCCG 

CATTGCCAAG 

GAACCAGAGA 

5 ATCAACTTGG 

CGAACCTCCA 

CTGAATACTT 

AACATTTTCA 

TCGTATTCCT 

10 GTTTGTTGCA 

GTCCAAGCTG 

TACTACTTTT 

CCGAGTTGGG 

TTGTTGGATG 

15 CGTTCAACAA 
TTTTGGGTAT 

GCCCGTAAGA 
TTT CACTGGT 
CTTTCAAGGA 

20 



GTTGGCTCCA 
GTGTCGGTAA 
GCTCACGGTG 
AGGTAACGAT 
AAGGTGAATG 
GGAGCTAGAG 
CAGAGATGAG 
GATTCACTCA 
TCGGCTGTCG 
AGAGCGTATT 
TCTATGTGCC 
GCTCACTTGG 
TATTTACCCA 
CCTCTGTTGT 
ACCTTGCAAG 
GGACGAATTG 
TCGAGAGATT 
ATCAGTGGTA 
AGTCTTGGAA 



TACGCCAGAG 
GACTGTGTTT 
GTTTCTCCGT 
TTGTACCGTG 
CAAGGTGGCC 
CCAGAGTTGC 
GAGGGTCAAG 
AGCTGGTTCT 
GTTACCAACC 
ACCACCACCA 
AGCCGATGAT 
ATGCTACCAC 
GCTGTCGATC 
CGGTGAGGAG 
CTTACAAGTC 
TCGGAAGCTG 
CTTGTCTCAA 
AGTTGGTCAG 
GGTAAGTACG 



GTGGTAAGAT 
ATCCAAGAGT 
GTTCACCGGT 
AAATGAAGGA 
TTGGTGTTCG 
CCTTACCGGT 
ATGTGTTGTT 
GAAGTGTCGG 
TACTTTGGCC 
AGAAGGGTTC 
TTGACCGATC 
TGTGTTGTCT 
CTTTGGATTC 
CACTACTCGG 
TTTGCAAGAT 
ACAAGTTGAC 
CCATTTGCTG 
ATTGGAGGAC 
ATCACTTGCC 



PCT/CAOO/01150 

TGGATTGTT'Cr "400 

TGATTAACAA 450 

GTCGGTGAAA 500 

AACTGGTGTC 550 

GTCAAATGAA 600 

TTGACCATCG 650 

GTTCGTCGAC 700 

CTTTGTTGGG 750 

ACCGATATGG 800 

CGTCACCTCT 850 

CTGCTCCTGC 900 

AGAGGTATCT 950 

CAAGTCGAGA 1000 

TTGCTTCTAA 1050 

ATCATTGCCA 1100 

CGTCGAGAGA 1150 

TTGCCGAAGT 1200 

ACTATCAGAT 1250 

AGA 1293 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 465 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 



CACTTTGACG 

40 GAACGGTGAC 
ACACCGTCAG 
GAGGCTGTCA 
GACTTTGGGT 
GACCAATCAA 

45 TTCGTTGAGC 
TGTCGACTTG 
GTGGTGCCGG 
ATTGCCAAGG 
AACCAGAGAA 

50 TCAACTTCGA 
GAGCCACCAG 
CGAATACTTC 
ACATTTTCAG 
CGTATTCCAT 

55 TACCTTGCAA 
TCCAGGCCGT 
ACCACTTTCG 
CGAGTTGGGT 
TGTTGGATGC 

60 GTCCAGCAGA 



ATGGTAACTT . 

CAGGACTTGG 

AACCATTGCC 

CTGACACTGG 

CGTATCATCA 

GTCCAAGAAG 

AGTCTACTTC 

TTGGCCCCTT 

TGTCGGTAAG 

CCCACGGTGG 

GGTAACGATT 

GGGTGACTCC 

GAGCTAGAGC 

AGAGATGAAG 

ATTCACTCAG 

CTGCTGTCGG 

GAAAGAATTA 

TTACGTGCCA 

CTCACTTGGA 

ATCTACCCAG 

CTCTGTTGTC 

CCTTGCAGGC 



GCCAGCCATT 
TCTTGGAGGT 
ATGGACGGTA 
TGCTCCAATC 
ACGTTATTGG 
AGAAACCCAA 
TGCTGAGGTT 
ACGCCAGAGG 
ACCGTCTTTA 
TTTCTCTGTC 
TGTACCGTGA 
AAGGTCGCCT 
TAGAGTTGCT 
AAGGTCAGGA 
GCTGGTTCCG 
TTACCAGCCA 
CCACCACCAA 
GCTGATGATT 
TGCTACCACT 
CTGTCGACCC 
GGTAAGGAGC 
CTACAAGTCC 
2 



TTCAACGCCT 
CGCCCAGCAC 
CCGATGGTTT 
TCCGTGCCTG 
TGAGCCAATT 
TCCACACTGA 
TTGGAGACTG 
TGGTAAGATT 
TCCAAGAGTT 
TTTACCGGTG 
AATGCAGGAG 
TGGTCTTCGG 
TTGACCGGTT 
TGTGTTGTTG 
AGGTGTCCGC 
ACCTTGGCCA 
GAAGGGTTCC 
TGACCGATCC 
GTGTTGTCTA 
ATTGGACTCC 
ACTACGATGT 
TTGCAGGATA 
84 



TGAAGTTGAA 50 

TTGGGTGAGA 100 

GGTCAGAGGC 150 

TTGGTCGTGA 2 00 

GACGAGAGAG 250 

CCCACCAACC 300 

GTATTAAGGT 3 50 

GGTTTGTTCG 400 

GATTAACAAC 450 

TCGGTGAGAG 500 

ACCGGTGTCA 550 

TCAGATGAAC 600 

TGACCATTGC 650 

TTCGTTGACA 700 

CTTGTTGGGT 750 

CCGATATGGG 800 

GTCACCTCTG 850 

TGCCCCAGCT 900 

GAGGTATTTC 950 

AAGTCTAGAT 1000 

TGCTACCAAC 1050 

TCATTGCCAT 1100 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TTTGGGTATG GATGAATTGT CCGAAGCCGA CAAGTTGACT GTCGAGAGAG 11BCT 

CTAGAAAGAT TCAGAGATTC TTGTCCCAGC CATTCGCCGT CGCCGAGGTT 1200 

TTCACTGGTA TCGAGGGTAA GTTGGTCAGA TTGGAGGAGA CCGTCAAGTC 1250 

TTTCAAGGAG GTCTTGGATG GTAAGTACGA CCACTTGCCA GAG 1293 



2) INFORMATION FOR SEQ ID NO: 466 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
20 (B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 

AACGGTGGAT CTAAGTTAGT CTTAGAAGTT GCTCAACATT TGGGTGAAAA 50 

25 CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

AACCAGTTAA TGATACTGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 150 

ACTTTAGGTA GAATCTTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 2 00 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TCATCAAGAT CCTCCTGCTT 250 

TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 3 00 

30 GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 3 50 

TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 400 

TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TCACTGGTGT CGGTGAAAGA 4 50 

ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA CTGGTGTTAT 500 

TAACTTAGAA GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 550 

35 AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 650 

CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 7 00 

GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 750 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 800 

4 0 CCAAGCAGTT TATGTCCCAG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

CTACTTTCGC CCACTTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 900 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 1000 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

45 TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 1100 

AAGAAAGATT C 1111 



50 2) INFORMATION FOR SEQ ID NO: 467 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 



285 



SUBSTITUTE SHEET (RULE 26) 
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(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 2 00 

10 TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 2 50 

AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 3 00 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 450 

15 CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA CCGGTGTCGG TGAAAGAACC 50 0 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 550 

CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 700 

2 0 TTTCAGATTC AC AC AAGC CG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 8 00 

TTGCAAGAAA GAATTACCAC TACCAAGAAG GGTTCCGTTA CCTCCGTCCA 850 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 950 

25 TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT CTAGATTGTT 10 00 

GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 110 0 

GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 1150 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 12 00 

30 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 1250 

AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 1283 



35 2) INFORMATION FOR SEQ ID NO: 468 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, krusei 

(B) STRAIN: ATCC 34135 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 



CTTCGAACAA GGCCAATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 50 

ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 100 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 150 

AACCGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 200 

55 CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 250 

CCAGTTGACT GTAAGGAAAG AAAGCCAATT CACGCTGATC CTCCAGCTTT 3 00 

CGTTGAACAA TCCACTGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 3 50 

TCGATTTATT AGCACCTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 400 

GGTGCTGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 450 

60 YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

286 
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CCAGAGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 

AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 60 0 

ACCACCAGGA GCTAGAGCAA GAGTTGCTTT AACTGGTTTG ACCATTGCAG 650 

AATATTTCAG AGATGAAGAA GGTCAAGATG TCTTGTTATT CATTGATAAC 700 

5 ATTTTCAGAT TCACCCAAGC AGGTTCTGAA GTCTCTGCAT TATTAGGTAG 750 

AATTCCATCT GCTGTCGGTT ATCAACCAAC TTTAGCAACC GATATGGGTC 8 00 

TTTTACAAGA AAGAATTACC ACCACCAAGA AGGGTTCCGT TACTTCTGTC 850 

CAAGCTGTTT ATGTCCCAGC CGATGATTTA ACCGATCCTG CTCCAGCTAC 900 

TACTTTCGCC CACTTGGATG CAACCACTGT CTTGTCYAGA GGTATTTCCG 950 

10 AATTAGGTAT CTACCCAGCT GTCGATCCAT TAGATTCTAA GTCTAGATTA 10 00 

TTAGATGTTG CAGTTGTTGG TCAAGAACAT TATGAAGTTG CAACTCAAGT 1050 

CCAAGAAACT TTACAAGCTT ACAAGTCTTT ACAAGATATT ATTGCTATTT 1100 

TGGGTATGGA TGAATTATCT GAACAAGATA AGTTAACYGT TGAAAGAGCA 1150 

AGAAAGATCC AAAGATTCTT ATCTCAACCA TTCTCTGTTG CAGAAGTTTT 12 00 

15 CACTGGTATT CCAGGTAAGT TAGTCAGATT AGAAGAAACC ATCAAGTCTT 1250 

TCAGGGATGT TCTTGCAGGT AAGTACGATC ACTTACC 12 87 



20 2) INFORMATION FOR SEQ ID NO: 469 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lambica 

(B) STRAIN: ATCC 24 750 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 



TCGAACAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

GATGGTCAAA AGTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 2 00 

40 CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 250 

TGTCGAMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 3 00 

TTGACCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 3 50 

GATTTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 4 50 

45 CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 550 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 7 00 

50 TTTCAGATTC ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 7 50 

TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 8 00 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 8 50 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 9 00 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

55 TTAGGTATTT ACCCAGCTGT CGATCCATTA GATTCTAAGT CTAGATTATT 1000 

GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA ACY CAAGTCC 1050 

AAGAAACTTT ACAAGCTTAC AAGTCCTTAC AAGATATCAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

60 CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 
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AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 1283 

5 2 ) INFORMATION FOR SEQ ID NO: 470 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1140 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Candida lusitaniae 

(B) STRAIN: ATCC 6603 5 



20 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 



AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

GTGAAACCTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATTCACA CTGAGCCACC 150 

ATCGTTTGTT GAACAATCCA CTTCTGCTGA AGTTTTGGAG ACTGGTATCA 2 00 

25 AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 250 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 3 00 

CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT GGTGTCGGTG 350 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGACCGGT 400 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

30 GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 500 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 550 

GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 650 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 700 

35 TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 900 

TAACGTTCAA CAAACTTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

40 CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 1000 

AGAGCCAGAA AGATCCAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 1100 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 1140 



2) INFORMATION FOR SEQ ID NO-: 471 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida noirvegensis 

(B) STRAIN: ATCC 22977 
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(xi) SEQUENCE DESCRIPTION: SEQ : 

TCAATTCGAA CAAGGTCAAT TACCACCAAT 
ACAACGGTGG AACTAAGTTA GTTTTAGAAG 
AACACCGTCA GAACCATTGC TATGGATGGT 
TCAACCAGTT AATGATACTG GTGCTCCAAT 
GTACTTTAGG TAGAATCTTA AACGTTATTG 
GGTCCAATCG ATTGTAAGGA AAGAAAACCA 
TTTCGTTGAA CAAT CAACTG AAGCTGAAGT 
TTGTCGATTT ATTAGCTCCT TACGCTAGAG 
GGTGGTGCAG GTGTTGGTAA AACCGTTTTT 
TGTTGCAAAA GCTCATGGTG GTTTCTCCGT 
GAACTAGAGA AGGTAACGAT TTATACAGAG 
ATTAATTTAG AAGGTGATTC TAAGGTCGCA 
CGAACCACCT GGAGCAAGAG CAAGAGTTGC 
CTGAATATTT CAGAGATGAA GAAGGTCAAG 
AACATTTTCA GATTTACTCA AGCAGGTTCT 
TAGAATTCCA TCCGCTGTCG GTTATCAACC 
GTCTTTTACA AGAAAGAATT ACTACTACTA 
GTTCAAGCAG TTTATGTCCC AGCAGATGAT 
AACTACTTTC GCCCACTTAG ATGCAACTAC 
CCGAATTAGG TATTTACCCA GCTGTCGATC 
TTATTAGATG TTGCAGTTGT TGGTCAAGAA 
AGTTCAAGAA ACTTTACAAG CTTACAAATC 
TTTTAGGTAT GGATGAATTA TCTGAACAAG 
GCTAGAAAAA TTCAAAGATT CTTATCTCAA 
TTTCACTGGT AT C C C AGGT A AGCTTGTTAG 
CATTCAGAGA TGTCTTAGCA GGTAAGTACG 



ID NO: 471 

TTTAAACGCT TTAGTCATGG 50 
TTGCTCAACA TTTAGGTGAA 100 
ACTGAAGGTT TAGTTAGAGG 150 
CTCTGTCCCA GTCGGTAGAG 200 
GTGATCCAGT CGATGAAAGA 2 50 

ATTCATCAAG ATCCTCCTGC 3 00 

TTTAGAAACT GGTATTAAAG 350 
GTGGTAAGAT TGGTTTATTC 4 00 

ATTCAAGAAT TAATTAACAA 450 
TTTCACTGGT GTCGGTGAAA 500 
AAATGAAGGA AACTGGTGTT 550 
TTAGTTTTCG GTCAAATGAA 600 
TTTAACTGGT TTAACTATTG 65 0 

ATGTCTTGTT ATTCATTGAT 700 
GAAGTTTCCG CTTTGTTAGG 750 
AACTTTAGCA ACTGATATGG 8 00 

AGAAAGGTTC TGTTACTTCC 850 
TTAACTGATC CTGCTCCAGC 900 
TGTCTTATCT AGAGGTATTT 950 

CATTAGATTC TAAATCTAGA 1000 

CATTATGATG TTGCAACTCA 1050 

TTTACAAGAT ATTATTGCTA 1100 

ATAAATTAAC CGTTGAAAGA 1150 

CCTTTCTCTG TCGCAGAAGT 1200 

ATTAGAAGAA ACTATTTCTT 125 0 

ATCACTTACC AGAAAA 12 96 



2) INFORMATION FOR SEQ ID NO: 472 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 

ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TACTTCTGTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 200 

GTACTTTGGG TAGAATCATC AATGTTACTG GTGACCCAAT TGATGAAAGA 250 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 300 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 350 

TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA AACCGTGTTT ATCCAAGAAT TGATCAACAA 450 

CATTGCCAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 
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CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC * 700 

AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 75 0 

TCGTATTCCA TCCGCTGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 8 00 

GTCTTTTGCA AGAACGTATT ACCACTACCA AAAAGGGTTC AGTTACTTCA 850 

5 GTCCAAGCCG TTTACGTGCC AGCTGATGAT TTAACCGATC CTGCTCCAGC 900 

TACCACTTTC GCTCACTTGG ATGCCACCAC TGTGTTGTCT AGAGGTATTT 950 

CGGAGTTGGG TATTTACCCA GCTGTCGATC GATTGGATTC CAAATCCAGA 10 00 

TTGTTGGATG CTGCCGTTGT TGGTCAAGAA CATTACGACG TTGCCACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAGTC CTTGCAAGAT ATCATTGCTA 1100 

10 TCTTGGGTAT GGATGAATTG TCCGAACAAG ATAAATTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TTCAAAGATT CTTGTCTCAA CCATTTGCTG TCGCTGAAGT 12 00 

TTTCACTGGT ATTCCAGGTA AATTGGTTAG ATTGTCTGAA ACTGTCAAGT 12 50 

CATTCAAGGA AGTCTTGGAA GGTAAGTACG ATAACTTGCC AGAAAAT 12 97 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 473 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 12 85 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, rugosa 
<B) STRAIN: ATCC 96275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 73 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 
GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 100 

35 AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 
CACTGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 2 00 

GTACCCTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 2 50 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 3 00 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 350 

40 TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATCCAGGAGC TGATTAACAA 450 
CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 50 0 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 
ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

45 CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 650 
CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 700 
AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 750 
TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 800 
GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

50 GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 
CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 100 0 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 105 0 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

55 TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 120 0 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 12 85 
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2) INFORMATION FOR SEQ ID NO: 474 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 



15 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 



TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGCTTTGGA AATCGACACT 50 

CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 

20 TGTCAGAACC ATTGCTATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 15 0 

ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCCCAGTTGG TAGAGAAACC 200 

TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 250 

AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 300 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 350 

25 GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 450 

CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 50 0 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 550 

CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 

3 0 CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 65 0 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 700 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACTAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 850 

35 AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 90 0 

CTTTCGCGCA TTTGGATGCC ACCACTGTGT TGTCCAGAGG TATCTCTGAA 950 

TTGGGTATCT ACCCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 1000 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

4 0 GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAGAGCTAG 1150 

AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 1200 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 1250 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 12 83 



2) INFORMATION FOR SEQ ID NO: 475 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropical is 

(B) STRAIN: ATCC 13803 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 



CGACGAAGGT 
GTGACCAAGA 
5 GTCAGGGCTA 
TGTCACTGAT 
TGGGTAGAAT 
ATTGAATGTA 
TGAACAATCT 

10 ATTTGTTGGC 
GCTGGTGTCG 
CAAAGCTCAT 
GAGAAGGTAA 
TTGGAAGGTG 

15 ACCTGGTGCT 
ACTTCAGAGA 
TTCAGATTCA 
TCCATCTGCT 
TGCAAGAACG 

2 0 GCTGTTTATG 
CTTCGCTCAC 
TGGGTATTTA 
GATGCTACTG 
ACAAACTTTA 

25 GTATGGATGA 
AAGATTCAAA 
TGGTATCCCA 
AAGATGTCTT 



AACTTGCCAG 
CTTGGTTTTG 
TTGCTATGGA 
ACCGGTGCTC 
TATCAACGTT 
AGGAAAGAAA 
ACTGCTGCCG 
TCCTTACGCT 
GTAAAACCGT 
GGTGGTTTCT 
CGATTTGTAC 
ACTC CAAGGT 
AGAGCTAGAG 
TGAAGAAGGT 
CCCAAGCTGG 
GTCGGTTATC 
TATTACCACC 
TCCCAGCCGA 
TTGGATGCCA 
CCCAGCTGTC 
TTGTTGGTCA 
CAAGCTTACA 
ATTGTCCGAA 
GATTCTTGTC 
GGTAGATTGG 
GGAAGGTAAA 



CTATTTTGAA 
GAAGTTGCTC 
TGGTACCGAA 
CAATTTCTGT 
GTTGGTGAAC 
GCCAATTCAC 
AAATTTTGGA 
AGAGGTGGTA 
CTTCATCCAA 
CTGTCTTTAC 
CGTGAAATGA 
CGCCTTGGTC 
TTGCTTTGAC 
CAAGATGTCT 
TTCCGAAGTG 
AACCAACCTT 
ACCAAGAAAG 
TGATTTAACC 
CTACTGTCTT 
GATC CATTGG 
AGAACATTAT 
AGTCCTTGCA 
GCCGATAAAT 
CCAACCATTC 
TTAGATTACA 
TACGATCACT 



TGCTTTGACT 
AACATTTGGG 
GGTTTAGTCA 
CCCAGTTGGT 
CAATTGATGA 
GCTGAACCAC 
AACCGGTATT 
AGATTGGTTT 
GAATTGATTA 
CGGTGTCGGT 
AAGAAACTGG 
TTCGGTCAAA 
TGGTTTGACC 
TGTTGTTCAT 
TCTGCTTTGT 
GGCCACTGAT 
GTTCTGTTAC 
GATCCAGCTC 
GTCTAGAGGT 
ATTCTAAATC 
GATGTTGCCA 
AGATATCATT 
TGACTGTCGA 
GCTGTTGCCG 
AGATACTGTT 
TGCCAGAAAA 



PCT/CA00/01 


150 


TTGAAGAACG 


r - /"v 

50 


TG AAAAC AC C 


100 


GAGGTACTGC 


150 


AGAGGTACCT 


200 


C AGAGGTC CA 


250 


CTTCATTCGT 


300 


AAGGTTGTCG 


350 


GTTCGGTGGT 


400 


ACAACATTGC 


450 


TV TV TV ^1 TV TV — _ TV 

GAAAGAACCA 


500 


TGTCATCAAC 


550 


fn/TTk *TV TV TV 

TGAACGAACC 


600 


ATTGCTGAAT 


650 


TGACAACATT 


700 


TGGGTCGTAT 


750 


ATGGGTCTTT 


800 


TTCTGTCCAA 


850 


TV r~ ^ III TV /^T TV 

CAGCTACCAC 


900 


ATTTCTGAAT 


950 


CAGATTATTG 


1000 


CTGGTGTTCA 


1050 


GCTATTTTGG 


1100 


AAGAGCTCGT 


1150 


AAGTTTTCAC 


1200 


AAATCTTTCA 


1250 




1290 



30 



35 



40 
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50 
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2 ) INFORMATION FOR SEQ ID NO: 476 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 6 



TTTGAACGCC 
AGGTTGCCCA 
GGTACCGAAG 
AATTACTGTC 
TTGGTGAGCC 
CCAATTCACG 
GGTTTTGGAG 
GAGGTGGTAA 
TTCATCCAGG 
TGTTTTCACC 
GTGAGATGAA 
GCTCTCGTCT 
CGCTTTGACC 



CTTGAGATCA 
GCACTTGGGT 
GTCTCGTCCG 
CCAGTTGGTC 
AATTGATGAG 
CTGAGCCACC 
ACTGGTATCA 
GATTGGTCTC 
AGTTGATTAA 
GGTGTCGGTG 
GGAGACTGGT 
TCGGTCAGAT 
GGTTTGACCA 



AGAACCACGA GTCTGGAAGA 
GAGAACACTG TCAGAACTAT 
TGGTGAGTCT GTTGTTGACA 
GTGAGACCTT GGGTCGTATC 
CGTGGTCCAA TCAACACCAA 
ATCCTTCGCT GAGCAGTCTA 
AGGTTGTCGA CCTTCTCGCC 
TTTGGTGGTG CCGGTGTCGG 
CAACATTGCC AAGGCCCACG 
AGAGAACCAG AGAGGGTAAC 
GTCATCAATC TCGAGGGTGA 
GAACGAGCCA CCAGGTGCCC 
TTGCTGAGTA CTTCAGAGAT 
292 



TTGGTCCTCG 50 

TGCCATGGAT 100 

CCGGTTCTCC 150 

ATCAACGTTA 200 

GCACAGAAAC 250 

CTGCTGCTGA 3 00 

CCATACGCCA 350 

TAAGACCGTT 400 

GTGGTTTCTC 450 

GATTTGTACC 500 

GTCCAAGGTT 550 

GTGCCCGTGT 600 

GAGGAGGGTC 650 
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AGGATGTCTT GTTGTTTATT GACAACATTT 
TCCGAGGTGT CTGCCTTGCT TGGTCGTATT 
GCCAACTTTG GCCACCGATA TGGGTTTGTT 
CCCAGAAGGG TTCCGTCACT TCTGTCCAGG 
GATTTGACTG ATCCTGCTCC AGCCACCACT 
CACTGTGTTG TCCCGTGGTA TCTCTGAGTT 
ACCCATTGGA CTCCAAGTCC AGATTGTTGG 
GAGCACTACA ACACCGCCAC CGATGTCCAG 
GTCTCTCCAG GATATCATTG CTATTTTGGG 
CTGACAAGTT GACTGTCGAG AGAGCCAGAA 
CAGCCATTCG CTGTCGCTGA GGTTTTCACC 
TAGACTTCAG GACACCATCA AGTCCTTCAG 
ACGACCACTT GCCAGAG 



TCAGATTCAC CCAGGCCGGT" 700 

CCATCCGCTG TCGGTTACCA 750 

GCAGGAGAGA ATTACCACCA 80 0 

CCGTTTACGT CCCAGCTGAT 850 

TTCGCCCACT TGGACGCCAC 900 

GGGTATTTAC CCAGCTGTCG 950 

ACGCTGCCGT TGTTGGTGAC 10 00 

CAGACCCTTC AGGCTTACAA 1050 

TATGGATGAG TTGTCTGAGG 1100 

AGATTCAGCG TTTCCTTTCC 1150 

GGTATCCCAG GTAGATTGGT 1200 

AGAGGTTTTG GACGGTAAGT 1250 

1267 



2) INFORMATION FOR SEQ ID NO: 477 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, viswanathii 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 

CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA 100 

AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 3 00 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 350 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 50 0 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGTG CTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 90 0 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 12 0 0 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 1296 
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2 ) INFORMATION FOR SEQ ID NO: 478 

( i ) SEQUENCE CHARACTERI STICS : 
5 (A) LENGTH: 12 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, zeylanoides 

(B) STRAIN: ATCC 7351 



15 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 



TCAATTCGAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCAC CTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCCCAGCA CTTGGGTGAG 100 

20 AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 200 

GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 250 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 300 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 350 

25 TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 450 

CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGACTC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

30 CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTGACCGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTCGTCGAC 700 

AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CAC CTTGGCA ACTGATATGG 800 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

35 GTGCAGGCCG TCTACGTCCC CGCTGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

CTGAGTTGGG TATCTACCCC GCCGTCGACC CCTTGGACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 1050 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

40 TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 1200 

TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 1250 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 1295 



2) INFORMATION FOR SEQ ID NO: 479 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 479 



10 



15 



TTCAGGAATT 
CTGATATGCT 
TTCACTGGTG 
AATGCAAGAG 
TTGTCTTCGG 
CTTACCGGTT 
CGGTAGGCTT 
TCGCTTTTCT 
GCTGGTTCTG 
TTACCAACCT 
CCACCACCAC 



GATTGTAAGT 
CTAGAACAAC 
TCGGTGAGCG 
ACCCGTGTCA 
TCAAATGAAC 
TGACCATTGC 
CATGCTTCTA 
AGTGCTTCTC 
AAGTGTCTGC 
ACTCTCGCCG 
CAAGGGATCC 



TCTGTTATCA 
ATTGCTAAGG 
TACCCGTGAG 
TTCAACTCGA 
GAGCCCCCTG 
TGAATACTTC 
TCGCTAGGGG 
TTTATTGACA 
CTTGCTCGGT 
TCGACATGGG 
ATTACTTCAG 



ACTAAAGCCG 
CTCACGGTGG 
GGTAACGATT 
CGGAGAGTCC 
GTGCCCGTGC 
CGTGACGAGG 
CGTGTGATAC 
ACATTTTCCG 
CGTATTCCTT 
TGTTATGCAG 
TGCA 
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ACAGCGGTTG 50 

TTACTCCGTG 100 

TGTACCATGA 150 

AAGGT CGCTC 200 

CCGTGTTGCC 250 

AAGGCCAAGA 300 

AGGAGGCTAA 350 

TTTCACTCAA 400 

CCGCTGTCGG 450 

GAACGTATCA 500 

534 



20 



25 



30 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 480 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



CGTCTTGATT 
ACTCGGTCTT 
TACCACGAAG 
CTGATGTTGT 
GAGGGTGACT 
TGGAGCCCGA 
TCCGAGACGA 
CGATTCACCC 
CTCCGCCGTC 
AGGAGCGAAT 



CAAGAATTGA 
CACCGGTGTC 
TGAGTTGCGC 
CCCTTCTTCT 
CCAAGGTCGC 
GCCCGAGTCG 
GGAGGGTCAG 
AAGCCGGTTC 
GGTTACCAGC 
TACCACCACC 



TCAACAACAT 
GGTGAGCGAA 
CGTCCGAGTG 
CAGATGAGGG 
CTTGGTCTTC 
CCTTGACCGG 
GATGTCTTGT 
CGAAGTGTCC 
CCACTCTGTC 
AAGAAGGGTT 



CGCCAAGGCC 
CTCGAGAGGG 
TTTCCCGGGG 
AAACCGGTGT 
GGTCAGATGA 
TTTGACCATT 
TGTTCATTGA 
GCCTTGTTGG 
CACCGACATG 
CCATCACTTC 



CACGGTGGTT 50 

TAACGATCTG 100 

AATCGCAAGA 150 

CATCAACCTC 2 00 

ACGAGCCCCC 250 

GCCGAATACT 300 

CAACATTTTC 3 50 

GTCGTATCCC 400 

GGTACCATGC 450 

CGTC 4 94 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO : 4 81 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 415 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 

295 
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TTCAGGAGCT TATCAACAAC 
TTCACTGGTG TCGGTGAGCG 
AATGCAGGAG ACTTCCGTCA 
5 TGGTCTTCGG TCAGATGAAC 
CTTACCGGTC TTACTGTAGC 
CGTGCTGCTC TTCATTGACA 
AGGTGTCTGC CCTTCTCGGT 
ACCCTCGCCG TCGACATGGG 
10 CAAGGGTTCC ATTAC 



ATCGCCAAGG CCCACGGTGG 
AACTCGTGAG GGTAACGATC 
TTCAGCTTGA TGGCGAGTCC 
GAGCCCCCTG GAGCTCGTGC 
TGAATACTTC AGAGATGAGG 
ACATTTTCCG ATTCACTCAG 
CGTATCCCCT CTGCCGTCGG 
TGGTATGCAA GAGCGTATTA 



TTACTCCGTC £t> 

TGTACCACGA 10 0 

AAGGTCGCCC 150 

CCGTGTCGCC 2 00 

AGGGTCAGGA 2 50 

GCCGGTTCCG 3 00 

TTACCAGCCC 3 50 

CCACCACCAC 4 00 

415 



15 



2) INFORMATION FOR SEQ ID NO: 482 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Geotrlchum spp. 

(B) STRAIN: Lev-4 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

30 AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCT CGA GGTTGCCCAG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTACCATTC CCGTTGGTCG TGGTACTCTT 200 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 2 50 

35 TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 3 00 

AGCAGT CTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 3 50 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 4 00 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGCCA 4 50 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

40 GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 700 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCG TATTC 750 

45 CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 850 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 10 00 

50 TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 12 00 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 12 50 

55 GAGATCCTTG AGGGCAAGTA CGACCACCTC C 12 81 



2) INFORMATION FOR SEQ ID NO: 483 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 

15 ATTCAAGAAT TGATCGTACG TTCCTCCGCC CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTCTAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 200 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 250 

20 TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 300 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 350 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 400 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 450 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

25 CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 550 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 



30 2 ) INFORMATION FOR SEQ ID NO: 4 84 



35 



40 



45 



50 



55 



60 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 



GCGTGGCCAG 
GTGGTGCCAC 
GAGCGTGGCC 
CCCTGCCTTT 
TCAAAGTCGT 
CTGTTCGGTG 
CAACAACATC 
GTGAGCGTAC 
GGTGTCATTA 
GATGAACGAG 
CTGTGGCCGA 
ATTGACAACA 
GCTGGGTCGT 
ATATGGGTGC 
ACGTCGGTGC 



AAGGTCATTG 
ACTGGGTCGT 
CCGTTAAGAC 
GTCGAGCAGT 
TGACCTGATT 
GTGCCGGTGT 
GCCAAGGCCC 
TCGTGAGGGT 
ACCTCGAGGG 
CCCCCGGGTG 
GTACTTCCGT 
TTTTCCGTTT 
ATCCCTTCGT 
CATGCAGGAG 
AGGCCGTCTA 



ACACTGGTGC TCCCATCACC 
ATTCTGAACG TCACGGGTGA 
TGACGTTTTC CGCCCCATTC 
CGACTGATGC CGAGATTCTC 
GCCCCTTACG CCCGTGGTGG 
CGGTAAGACC GTGCTTATCC 
ACGGTGGTTT CTCCGTGTTC 
AACGATTTGT ACCACGAGAT 
TGACTCGAAG GTGGCTCTGG 
CCCGTGCGCG TGTCGCTCTT 
GACGACGAGG GCCAGGATGT 
CACCCAGGCC GGTTCGGAGA 
CGGTCGGTTA CCAGCCCACT 
CGTATCACCA CCACCAAGAA 
CGTGCCCGCC GATGATGTCA 
297 



ATCCCCGTCG 50 

CCCTATTGAC 100 

ACCGTGACCC 150 

GAGACTGGTA 200 

TAAGATTGGT 250 

AGGAGCTCAT 3 00 

ACTGGTGTCG 350 

GATTGAAACC 400 

TGTTCGGTCA 450 

ACTGGTCTGA 500 

GCTGCTGTTC 550 

CTTCGGCTCT 600 

TTGTCGACCG 650 

GGGTTCGATT 700 

CTGACCCTGC 750 



SUBSTITUTE SHEET (RULE 26) 



10 



25 



55 



WO 01/23604 PCT/CA00/01 150 

CCCTGCCACT ACCTTCGCCC ACCTTGACGC TACCACTGTG CTTGACCGTT 800 

CGATCGCTGA GCTGGGTATC TACCCCGCTG TTGACCCGTT GAACTCGAAC 850 

TCGCGTATGC TTGACCCCGC TATTGTGGGT CAGGAGCACT ACGACGTGGC 900 

CTCTGGCGTG CAGAAGCTGC TCCAGGACTA CAAGTCGCTT CAAGATATCA 950 

TTGCCATTCT GGGTATGGAT GAGCTTTCTG AGGAGGACAA. GCTCACTGTC 1000 

GAGCGTGCCC GTAAGATGCA GCGTTTCATG TCGCAGCCTT TCGCTGTCGC 1050 

CCAGGTCTTT ACTGGTATCG AAGGTCGTCT TGTTGCCCTG AAGGACACGA 1100 

TCAAGGCCTG CAAGGAGATC CTGTCGGGCA AGCACGACAA CCTCC 1145 



2 ) INFORMATION FOR SEQ ID NO: 4 85 



( i ) SEQUENCE CHARACTERI STI CS : 
15 (A) LENGTH: 12 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 



TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GCCTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 100 

30 ATTGCTATGG ACGGTACCGA GGGTCTTGTC' CGTGGTCAGA AGGTCCTTGA 150 

CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 200 

TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 250 

GACGTCTACC GTCCAATCCA CCGTGAGCCC CCGGCGTTCG TTGAGCAGTC 300 

GACTGATGCT GAGATTCTTG AGACTGGTAT CAAGGTGGTC GACCTGCTCG 350 

35 CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 400 

GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 450 

CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 500 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 600 

40 CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 650 

ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 700 

ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 800 

GTATYACCAC CACCAAGAAG GGTTCGATTA CCTCGGTGCA GGCYGTTTAC 850 

45 GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

CTTGGACGCC ACCACGGTGT TGGACCGTTC GATTGCYGAG CTGGGTATCT 950 

ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 1000 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 1100 

50 AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 1150 

CGTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 1200 

GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTCCGGTAA G 1261 



2) INFORMATION FOR SEQ ID NO: 486 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1282 bases 

298 
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WO 01/23604 PCT/CA00/01 150 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima. 

(B) STRAIN: DSM 7 033 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 



AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GACCAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

15 CAGAACCATT GCCATGGACG GTACCGAGGG TTTGGTCAGA GGCGCCTCTG 150 

TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 200 

GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 250 

CAACACCAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 300 

AGCAATCCAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 350 

2 0 TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 4 00 

CGGTGTCGGT AAGACCGTGT TCATCCAGGA GTTGATTAAC AACATTGCCA 450 

AGGCCCACGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCAGA 500 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGACTGGTG TCATCAACTT 55 0 

CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 600 

25 CAGGAGCTAG AGCYAGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 700 

CAGATTCACC CAAGCCGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 800 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

3 0 CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA 1000 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AGCTTACAAG TCCTTGCAGG ATATCATTGC CATTTTGGGT 1100 

35 ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

AAGATCCAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 12 00 

CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACACCGTT AGATCCTTTA 12 50 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 12 82 



2) INFORMATION FOR SEQ ID NO: 487 



( i ) SEQUENCE CHARACTERI STICS : 
4 5 (A) LENGTH: 4 82 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

■ ( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 



TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

. TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGCCAAGG CTCACGGTGG 100 

60 TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 

299 
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10 



20 



TGTACCACGA AATGCAGGAA ACTGGTGTCA TTCAGCTCGA GGGTGAATCC 200 

AAGGTCGCCC TCGTGTTCGG TCAGATGAAC GAGCCCCCCG GTGCCCGTGC 250 

CCGTGTCGCT CTTACTGGTT TGACCATTGC CGAGTACTTC CGTGACGAGG 3 00 

AAGGTCAGGA CGTGCTTCTC TTCATTGACA ACATTTTCCG TTTCACTCAG 350 

GCCGGTTCTG AGGTGTCTGC CCTTCTGGGT CGTATCCCCT CTGCCGTCGG 4 00 

TTACCAGCCC ACCCTTGCCG TCGACATGGG TATCATGCAG GAGCGTATTA 450 

CCACCACCAC CAAGGGTTCC ATCACCTCCG TC 4 82 



2) INFORMATION FOR SEQ ID NO: 488 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 90 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 

TCGAACAAGG TAACTTACCA GCTATCTTGA ATGCTTTAGA AATCAAAACC 50 

CCAAGTGGTA CTCCATTAGT TTTAGAAGTT GCTCAACATT TAGGTGAAAA 100 

CACTGTCAGA ACTATTGCTA TGGATGGTAC TGAAGGTTTA GTTCGTGGTG 150 

3 0 AACAAGTTAC TGATACTGGT TCTCCAATCA CTGTCCCAGT TGGTCGTGAA 2 00 

ACTTTAGGTC GTATTATCAA CGTTGTTGGT GAACCAATTG ATGAACGTGG 2 50 

TCCAATTAAC ACCAAACAAA GAAACCCAAT TCACGCTGAA CCACCTTCAT 3 00 

TCAGTGAACA ATCAACTGCT GCTGAAGTTT TAGAAACTGG TATCAAAGTT 350 

GTYGATTTAT TAGCTCCATA CGCTAGAGGT GGTAAAATTG GTTTATTCGG 4 00 

35 TGGTGCCGGT GTCGGTAAAA CTGTCTTTAT CCAAGAATTG ATTAACAACA 450 

TTGCTAAAGC TCATGGTGGT TTCTCAGTTT TCACCGGTGT TGGTGAAAGA 50 0 

ACCAGAGAAG GTAACGATTT ATACCGTGAA ATGAAAGAAA CTGGTGTTAT 550 

TAACTTGGAA GGTGATTCTA AGGTCGCTTT AGTTTTCGGT CAAATGAATG 600 

AACCACCAGG TGCTAGAGCT CGTGTTGCTT TAACTGGTTT GACCATTGCT 650 

4 0 GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCGTTGATAA 700 

CATTTTCAGA TTCACCCAAG CCGGTTCAGA AGTTTCTGCC TTATTAGGTC 750 

GTATTCCATC TGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 800 

TTGTTACAAG AACGTATTAC CACCACACAA AAAGGTTCAG TTACTTCTGT 850 

CCAAGCTGTT TATGTCCCAG CTGATGATTT AACAGATCCT GCTCCAGCTA 900 

45 CCACTTTCGC CCATTTGGAT GCTACTACTG TCTTGTCTCG TGGTATTTCA 950 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCAAGATT 1000 

ATTAGATGCT TCAGTTGTTG GTCAAGAACA TTATGATGTT GCTACCAACG 1050 

TTCAACAAAC TTTACAAGCT TACAAATCTT TACAAGATAT TATTGCTATT 1100 

TTAGGTATGG ATGAATTGTC TGAACAAGAT AAATTGACTG TCGAAAGAGC 1150 

50 AAGAAAAATC CAAAGATTCT TATCTCAACC ATTTGCTGTT GCCGAAGTTT 1200 

TCACTGGTAT YCCAGGTAGA TTGGTTAGAT TAAAAGACAC TATCAAATCA 1250 

TTCAAAGATG TTTTGGAAGG TAAATATGAT CACTTACCAG 1290 



55 

2 ) INFORMATION FOR SEQ ID NO: 489 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1291 bases 
60 (B) TYPE: Nucleic acid 

300 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 15 0 

15 TGGTGAACAA GTTACTGATA CTGGTTCTCC AATCACTGTC CCAGTTGGTC 200 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 300 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 350 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 4 00 

2 0 TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 4 50 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 600 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

25 TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

30 AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

35 AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 1200 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 1250 

AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATCACTT A 1291 



40 



50 



2) INFORMATION FOR SEQ ID NO: 490 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 50 8 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 

CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

60 GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 200 

301 
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TCATTCAGCT CAAGAACGAC 

ATGAACGAGC CCCCCGGAGC 

CATCGCCGAG TACTTCCGTG 

TCGACAACAT TTTCCGATTC 

5 CTCGGACGTA TCCCATCTGC 

TATGGGTGGT ATGCAAGAGC 
CCTCCGTC 



AAGTCCAAGG CCGCTCTGGT 
TCGTGCCCGT- GTCGC T CTGA 
ACGTCGAAGG ACAGGATGTG 
ACCCAGGCCG GTTCAGAGGT 
TGTCGGATAC CAGCCCACAC 
GAATCACAAC CACCAAGAAG 



PCT/CAOO/01150 

CTTCGGACKG * 2SV 

CCGGTCTCAC 3 00 

CTACTCTTCA 350 

ATCTGCCCTG 4 00 

TCTCAACCGA 450 

GGTTCGATTA 500 

508 



10 

2) INFORMATION FOR SEQ ID NO: 491 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 686 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

3 0 CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 200 

CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 3 00 

TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 4 00 

35 CTCTTCATCG AC AACAT CTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 600 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

40 CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 686 



2) INFORMATION FOR SEQ ID NO: 4 92 

45 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 

60 TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 

302 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

ACTCCGTTTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

CACACTTTCG CCGACCTGGC CATTCCGGAA CGTGCGGACT GACGAGGACC 150 

ACCTCGAACA GATCTCTACC ACGAGATGAT CGAGACCGGT GTCATTCAGC 2 00 

TCGACAACGA CAAGTCGAAG ACTGCTCTCG TCTTCGGCCA GATGAACGAG 2 50 

5 CCCCCTGGCG CCCGTGCCCG TGTCGCTCTT ACTGGTCTCA CCATCGCGGA 3 00 

GTACTTCCGT GACGACGAGG GCCAGGACGT GCTTCTCTTC ATCGACAACA 350 

TCTTCCGTTT CACCCAGGGT ACGTTCGATC CCGCCCGTCC AACACGAATG 4 00 

TCGTGGTGAC TGACAACCTG TTGCGCGTGC AGCCGGTTCG GAGGTGTCTG 450 

CCCTTCTCGG TCGTATCCCG TCCGCTGTCG GATACCAGCC CACTCTCTCG 500 

10 ACCGACATGG GTGGCATGCA GGAGCGTATC ACGTACGCCC TCTTCTGCTT 550 

TCTCTCGTTT CGCTCTGCAT CGCTCACGCA TGTTCGCCCG ACAGCACCAC 60 0 

CAAGAAGGGT TCGATCACCT GTGTC 625 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 493 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-14 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 



TCTCGTCCGT GGTGCTAAGG CCACTGACAC TGGTTCCCCC ATTACCATCC 50 

CCGTCGGCCC CGGTACCCTC GGTCGCATCA TGAACGTCAC CGGTGACCCG 100 

35 ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC CCATCCACGC 150 

TGAGGCTCCC GAGTTCGTTG ACCAGTCGAC CACCGCTGAG GTT CTCGTGA 200 

CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG TGGTGGTAAG 250 

ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGAC CGTGT TCATCCAGGA 300 

GCTCATCAAC AACATCGCCA AGGCCCACGG TGGTTACTCC GTCTTCACCG 350 

4 0 GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG 400 

GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG CCC TGGTGTT 450 

CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC GCCTTGACCG 500 

GTTTGACTGT CGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGATGGTATG 550 

TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA ATTACTAACT 600 

4 5 TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC CCAGGCCGGT 650 

TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG TCGGTTACCA 700 

GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAGCGT ATTACCACCA 750 

CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT GCCCGCTGAC 8 00 

GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC TGGACGCCAC 850 

50 CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC CCCGCTGTCG 900 

ACCCCCTCGA CTCCAAGTCG CGTATGCTGG ACCCCCGTAT TGTCGGTGAC 950 

GACCACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC AAGAGTACAA 100 0 

GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG CTGTCTGAGG 105 0 

CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG TTTCCTGAGC 1100 

55 CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG GCCAGCTGGT 1150 

CGATCTGAAG GACACTATCG CTTCGTTCAA GGCTATCCTG AGCGGTGAGG 12 0 0 

GTGACAGCCT T 12 n 



303 
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2 ) INFORMATION FOR SEQ ID NO: 494 



PCT/CA00/01150 



10 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

. (A) ORGANISM: Stephanoascus ciferrii 
(B) STRAIN: ATCC 52550 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 

TCTTGTTAGA GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 50 
CAGTTGGTAA CGGCACTTTG GGCCGTATCG TCAACGTTCT CGGTGAGCCA 100 
ATTGATGAGC GTGGACCAGT CAAGGCTGAC AAGTTCAGAC CTATTCACGC 150 

20 TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 2 00 

CCGGTATTAA GGTTGTCGAC TTGCTTGCCC CTTATGCCAG AGGTGGTAAG 250 
ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 3 00 

GCTTATTAAC AACATTGCTA AGGCCCACGG TGGTTACTCT GTCTTCACTG 3 50 

GTGT CGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 4 00 

25 GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 4 50 

CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 500* 
GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGCCA GGATGTCTTG 550 
TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 
TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 650 

3 0 CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 7 00 

TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 
TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC ACCGAATTGT 8 00 

CCCGATCTAT CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 850 
TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGGCCAAG AGCACTACGA 900 

35 CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 950 

ATATCATTGC CATTTTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 1000 

ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 1050 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 1100 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG 1133 

40 



2) INFORMATION FOR SEQ ID NO: 4 95 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophyt.es 
55 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 
60 GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 

304 
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WO 01/23604 

AGTATTAATT GTTTTTATAG AACAACATTG 
TCTGTCTTCA CTGGTGTCGG AGAGCGTACC 
CCATGAAATG CAGGAGACCC GTGTCATTCA 
TCGCCCTGGT CTTCGGCCAG ATGAACGAGC 
5 GTTGCTCTTA CTGGTTTGAC CATTGCTGAG 
TCAAGACGGT GAGTTTCTTA TGGATAAAAA 
TTTTTTTCAA GAAATTCATG TTCTAACAAA 
TCATCGACAA CATTTTCCGT TTCACTCAGG 
CTGCTTGGTC GTATCCCATC TGCCGTCGGT 
10 CGACATGGGT GGTATGCAGG AACGTATTAC 
TTACCTCC 



PCT/CA00/01150 

CCAAGGCTCA CGGTGGTTAt: H5® 

CGTGAAGGAA ACGATCTCTA 200 

GCTTGATGGC GAGTCCAAGG 250 

CCCCAGGTGC CCGTGCCCGT 300 

TACTTCCGTG ATGAGGAAGG 350 

AAAATTTTTT TTTTTTTTTT 400 

GTGTATTCTA GTGCTTCTCT 450 

CTGGTTCCGA AGTGTCTGCC 500 

TACCAACCCA CTCTTGCCGT 550 

CACCACCAAG AAGGGATCCA 600 

608 



15 2 ) INFORMATION FOR SEQ ID NO: 496 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wang i el la dermatitidis 

(B) STRAIN: WSA-229 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 



GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 50 

AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT 100 

CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 150 

CACGAAATGC AGGAGACCTC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 200 

3 5 CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 250 

TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 3 00 

CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 350 

CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 400 

TTACAAGATT ATGCTAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 450 

40 GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 500 

AATATCGACG GCGTACTCTT CGCATCAAAA AACCAAAGAG GTGGTTTGGT 550 

GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 60 0 

GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 650 

TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 700 

45 CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 750 

GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 794 



50 2) INFORMATION FOR SEQ ID NO: 497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1148 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

305 
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15 



20 



25 



30 



45 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 



ATCACTATCC 


50 


TGGTGAGCCC 


100 


CCATCCACGC 


150 


GTTCTCGAGA 


200 


AGGTGGTAAG 


250 


TCATCCAGGA 


300 


GTTTTCTGCG 


350 


AGAGATGAAG 


400 


CCCTCGTCTT 


450 


GCCCTTACTG 


500 


GGATGTGTTG 


550 


CCGAGGTGTC 


600 


CCCACTCTGG 


650 


CCAGAAGGGT 


700 


ATTTGACCGA 


750 


ACCGTCCTGT 


800 


TCCCCTTGAT 


850 


AGCACTACGA 


900 


TCTCTCCAGG 


950 


GGACAAGCTG 


1000 


AGCCCTTCAC 


1050 


TCTCTCAAGG 


1100 


CGATGCTC 


1148 



2) INFORMATION FOR SEQ ID NO: 498 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT CGTTGTTGCT GCCTCCGACG GTCAGATGTA GGTGGAACAT 50 

CTTGGGAAAT ACGTCGTAAA ACACGTCGCT TACGTTTTCG CGAATAGGCC 100 

50 CCAGACTCGT GAGCATTTGC TGCTCGCCCG CCAGGTTGGT GTCCAGAAGA 150 

TCGTTGTCTT CGTCAACAAA ATCGATGCTA TTGATGATCC GGAGATGCTG 200 

GAACTGGTCG AACTCGAGAT GCGTGAGCTG CTGAACAGCT ACGGTTTCGA 250 

GGGTGAAGAG ACTCCGATCA TTTTCGGTTC CGCTCTCTGT GCTCTCGAAG 300 

GACGCCGTGA CGACATCGGT AAAGACAGAA TTGAGCAGCT TATGAACGCT 350 

55 GTCGACACCT GGATCCCCAC TCCTCAGCGT GACCTCGACA AACCTTTCTT 400 

GATGTCTGTC GAGGAAGTGT TCTCTATCGC CGGCCGTGGT ACCGTGGCTT 450 

CTGGTCGTGT CGAGCGTGGT ATCTTGAAGA AGGACTCTGA GGTTGAGATT 500 

GTTGGAGGCT CCTTCGAACC CAAGAAGACC AAAGTCACCG ACATTGAAAC 550 

CTTCAAGAAG AGCTGTGATG AATCGCGTGC TGGTGACAAC TCTGGTCTCC 600 

60 TCCTGCGTGG TATCCGACGT GAAGACGTCA AGCGTGGTAT GGTCATTGCT 650 

306 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GTTCCCGGCA GCACCAAGGC 
CCTGACCGAG GCGGAGGGTG 
GTCCCCAAGT CTTCATCCGT 
AGATCTTCCG AGAGATTAGC 
5 TGCTGACCTC AGCTTCCCTG 
GTGACAACGT CGAGATGATC 
GCTGGTCAAC GCTTCA 



TCACGACAAG TTCCTCGTCT 
GTCGTCGTAC TGGCTTCGGT 
ACTGCAGGTA AGTTCCCGCA 
GATATATGCT aatgattcat 
ACGGCGACCA ATCTCGCAGA 
CTGAAGACCC ACCACCCTGT 



CCATGTACGT 700 

GCCAACTACC 750 

CACCGTGTCC 800 

CAGACGAGGC 850 

GTTATGCCTG 900 

TGCTGCTGAG 950 

966 



10 



20 



45 



60 



2) INFORMATION FOR SEQ ID NO: 499 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 84 6 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

3 0 CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 2 00 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 2 50 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 3 00 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 400 

35 GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 450 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 650 

40 AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 

TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 846 



2) INFORMATION FOR SEQ ID NO: 500 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



307 
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(xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 500 



10 



15 



20 



GGTGCCATTA 
TGAGCATCTT 
TTGTTAACAA 
GAGATGGAGA 
CTCTCCTGTC 
CCGAGATTGG 
TGGATCCCTA 
TGAGGCCTCT 
TCGAGCGTGG 
AACAAGCAGC 
GGAACTTGAT 
GTATTCGTCG 
ACTGTTAAGG 
GGAAGAGGGC 
TGTTCGTTTC 
GTTGAACAGC 
CGAGCTCGTT 



TTGTTGTTGC 
TTGCTTGCCC 
GATTGATACC 
TGCGTGAACT 
ATTATGGGCT 
TAAGGAC CGT 
CCCCCGAGCG 
TTCTCCATTT 
TATTCTCAAG 
CCCTGAAATC 
CAGGCCCAGG 
TGAGGACTTG 
CTCACACCAA 
GGCCGTCACT 
TGCAGCTGAT 
ACACTCAGGT 
CACCCTACTG 



TGCCTCTGAT 
GCCAAGTCGG 
ATTGATGACC 
GTTGAATGAA 
CTGCTCTTGC 
ATCATGCAGC 
TGACCTTGAC 
CTGGTCGTGG 
AAGGGTGAGG 
TGTTGTTACT 
GCGGTGATAA 
CAGCGTGGTA 
GTTCCTTTCC 
CTCCTTTCGG 
GTCACCGTTA 
CTTCCCTGGT 
CTATTGAGGT 



GGACAGATGC 
TATGCAAAAG 
CTGAAATGCT 
TATGACTTCG 
TGCTCTTGAG 
TCTTGGACGC 
AAGCCTTTCA 
TACTGTTGCC 
AAGTCGAGAT 
GGTATTGAAA 
TGCTGGTATC 
TGGTTTTGGC 
TCCATCTACG 
TATGAACTAT 
CTCTTACTTT 
GAGAACACCG 
TGGTCAACGC 



PCT/CA00/01150 



CCCAGACCCG 50 

GTCGTTGTGT 100 

TGATCTTGTC 150 

ATGGAGATAA 2 00 

GACAAGAACC 250 

TGTTGATGAA 3 0 0 

TGATGC C TAT 350 

ACTGGCCGTG 4 00 

CGTTGGTTTC 450 

TGTTCAAGAA 500 

TTGCTTCGTG 550 

CAAGCCTGGA 60 0 

TTCTCTCCAA 650 

CGTCCCCAGA 700 

CCCTGAGGGT 750 

AGATGGTTGG 80 0 

TTCAAC 84 6 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 501 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidloides immitis 

(B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 



AGTTGTCGTC 
CTCCCGGATC 
GAGAGCATTT 
TTCGTGAACA 
TGAATTGGAG 
AAACTCCCAT 
CCCGAGATCG 
CTGGATTCCC 
TTGAGGAAGT 
GTGGAGCGTG 
TTCGCCCGAG 
AGTCTTGCGA 
GGCGTTAAGC 
AAGTGTCAAG 
AAGCTGAGGG 
ATGTTCATTC 
GGTACTGATT 
GAAGATCAAG 
CACCACCTTG 



GTTGCTGCTT 
TTGGTTTAAA 
ACTTCTCGCC 
AGGTTGATGC 
ATGCGTGAAC 
CATTTTTGGC 
GTGTTACCAA 
ACTCCTCAGC 
GTTCTCTATT 
GTATCCTCAA 
CCAATCAAAA 
CGAGTCTCGC 
GTGAAGATAT 
GCCCATACTG 
TGGGCGCAAA 
GCACTGCCGG 
GAATTCTATA 
ACAAGATGGC 
CACCCAGTTG 



CAGACGGTCA 
TGGCACTAAT 
CGTCAGATCG 
CATCGAGGAC 
TCCTAACCAG 
TCTGCTCTCT 
GATTGATGAG 
GTGAGACTGA 
TCCGGACGAG 
GAAGGACTCC 
CCAAGGTTAC 
GCTGGTGATA 
TAGCCGTGGC 
AATTCTTAGT 
TCTGGATTCA 
TATGTAATAC 
GACGAAGCGG 
TATGCCAGGA 
CCGCCGAGGC 



AATGTATGCA 
ATAAGACAGG 
GTATCCAAAA 
AAAGAGATGT 
CTACGGTTTC 
GTGCCCTCGA 
CTCTTGCAGG 
CAAGCCCTTC 
GAACCGTTGT 
GAAGTTGAAA 
CGATATCGAA 
ACTCCGGCTT 
ATGGTCGTCG 
TTCGCTTTAC 
GCAGCAAGTA 
TGTGATAATT 
CTCAGCTCAG 
GACAATATCG 
TGGCCAGCGA 



ACCGAGAGCA 5 0 

CCTCAAACTC 100 

AATCGTCGTC 150 

TGGAGCTTGT 2 00 

GAGGGTGAAG 25 0 

AGGAAGACAA 300 

CCGTCGACAC 3 50 

TTGATGTCCA 4 00 

CTCCGGCCGT 4 50 

TTGTCGGCGG 50 0 

ACCTTTAAGA 550 

GCTCCTACGA 600 

CTGTACCAGG 650 

GTCCTCACCG 70 0 

CCGCCCACAG 750 

TCGTTGACAT 8 00 

CTGGCCCGGA 850 

AAATGATTTG 90 0 

TTCA 944 



60 
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2) INFORMATION FOR SEQ ID NO: 502 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9* bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxyspomm 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGACC CCG AGATGCTTGA GCTCGTCGAG 150 

ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 2 00 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 2 50 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 3 00 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 3 50 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 4 00 

AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 4 50 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC ATTGAGACCT TCAAGAAGTC 500 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 750 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

TGTCACCATG GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 84 9 



2 ) INFORMATION FOR SEQ ID NO: 503 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 

TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT .150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 2 50 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 3 00 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 350 

TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 4 00 
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10 



15 



GGATCCCCAC 
GAGGAAGTAT 
TGAGCGCGGT 
GCTCCACCCC 
TCCTGTGACG 
TATCAAGCGT 
GCGTCAAGGC 
GCTGAGGGTG 
GTTCATCCGC 
TTCTTTTTTT 
GCCTGATTCA 
ATCTCAGCTT 
GACAACGTCG 
CGGCCAGCGA 



ACCACAACGT 
TCTCTATCTC 
GTCCTCAAGA 
CATCAGGACG 
AGTCTAGAGC 
GAAGATATCC 
CCACGACAAG 
GTCGCCGAAC 
ACAGCTGGTA 
CCTTTTCCTC 
CGAAATTAAC 
CCCTAGTGGA 
AGATGATCCT 
TTCA 



GATACCGAAA 
CGGCCGTGGA 
AGGATTCAGA 
AAGGTAACTG 
TGGGGACAAC 
GCCGTGGTAT 
TTCTTGGTGT 
CGGATTCGGC 
TGTCAAAATG 
TCTATCTCTC 
TAACCCGTTT 
GCAGATGAAA 
CCAGACACAC 



AACCTTTCTT 
ACCGTTGCCT 
AGTCGAGCTA 
ATATCGAAAC 
TCCGGTCTTT 
GGTAGTTGCC 
CGATGTATGT 
CAGAACTATC 
GGACCCCTTT 
TTTCTGTTTC 
GATTATAGAC 
GCAAACTCGT 
CGCCCCGTGG 



GATGTCCGTTl 4'5© 

CCGGTCGTGT 500 

ATTGGGGGCG 550 

TTTCAAGAAA 600 

TATTGCGTGG 65 0 

GTTCCTGGCA 700 

CCTGACCGAA 750 

GTCCTCAAAT 800 

TCATAATCCT 850 

CTTTCAACTC 900 

GAAGCCGCCC 950 

TATGCCTGGT 1000 

CTGCTGAGGC 1050 
1064 



2) INFORMATION FOR SEQ ID NO: 504 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 



35 



40 



45 



50 



55 



TGGTGCTATC 
GCAAGACTGG 
TTGGTATCAG 
GGTGTTCAGA 
TAAGGAGATG 
CCTATGGGTT 
TGTGCTATGG 
ATTACTCGAG 
ACAAGCCCTT 
GGAACCGTTG 
CGAAGTTGAA 
CTGATATCGA 
AACTCCGGCC 
TATGGTCGTT 
TGTCGATGTA 
GGTCAGAACT 
TCTTTCACTA 
GACACAGACG 
CAAAATGGTC 
GCCCGGGGCC 



GTCGTTGTTG 
TGAAAAAATC 
GCCCCAAACA 
AAATCGTTGT 
TTGGAGCTTG 
CGAGGGTGAG 
AGGGCCGTCA 
GCTGTGGATA 
CCTGATGTCC 
CCTCCGGCCG 
ATTATTGGCG 
AACCTTCAAG 
TCTTGTTGCG 
GCAGTTCCCG 
CGTTCTGACC 
ATCGTCCTCA 
TATTCCTATA 
AGGCTGCTGA 
ATGCCCGGTG 
GCTGAGGCTG 



CTGCCTCTGA 
TAAAGAAAAT 
CGCGAGCATC 
TTTCGTCAAC 
TCGAATTGGA 
AAGACACCTA 
GCCCGAGTTG 
CTTGGATCCC 
ATTGAGGAGG 
CGTTGAGCGT 
GCGGTGTTCC 
AAGTCTTGCG 
CGGTGTCAAG 
GAAGCGTCAA 
GAGGCTGAGG 
AATGTTCATC 
TGCATAGCCC 
ACTCAGCTGG 
ACAATGTTGA 
GACAGCGATT 



CGGC CAAATG 
AGAAAAGATT 
TGCTTCTTGC 
AAGGTCGATG 
GATGAGAGAG 
TCATCTTTGG 
GGAGAGCAGA 
TACGCCACAG 
TGTTCTCTAT 
GGTATCCTCA 
CACAATCCTG 
ACGAGTCCAG 
CGTGAGGATA 
AGCACATGAC 
GTGGTCGCCG 
CGCACAGCTG 
GATCCTCCCA 
CCTGATGGAG 
AATGGTCCTG 
CA 



TAGGGATTTT 50 

GTGCTGATGT 10 0 

CAGACAAGTC 15 0 

CTGTAGAGGA 200 

CTCTTGACCA 250 

TTCTGCGCTC 300 

AAATTGATGA 350 

CGTGATACTG 4 00 

CTCTGGACGA 450 

AGAAGGACTC 500 

ACCAAGGTGA 550 

AGCCGGGGAC 600 

TCCGCCGTGG 65 0 

AGATTCTTGG 700 

CACTGGCTTC 750 

GTACGTTCAT 80 0 

TTAACTAATT 850 

ACGACGAAGC 900 

AAGTCACACC 950 

982 



2) INFORMATION FOR SEQ ID NO: 505 



60 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 931 bases 



310 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffel 

(B) STRAIN: ATCC 58 950 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

15 CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

2 0 TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

25 CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

3 0 CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



35 2) INFORMATION FOR SEQ ID NO: 506 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 182 05 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCTCAAACCA 5 0 

GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 2 00 

55 ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 2 50 

GAAGAAATTG GTGTCAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 300 

ATATATCCCA ACCCCACAAA GAGATTTAGA AAAACCATTC TTGATGGGTG 350 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTACCGTTGT TACTGGTCGT 4 00 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 450 

60 AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTTCAAAA 500 

311 
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AAGAATTGGA 
GGTATCAAAA 
TACCATCTCW 
AAGAAGAAGG 
TTATTCATCA 
TGGTGATTCA 
GTGAATTGGT 



CCAAGCTATG 
GAGATGACAT 
GCTCACACTA 
TGGTCGTCAC 
GAACTGGTGA 
TCTCAACAAA 
TCACCCAACT 



GCTGGTGATA 
YAAAAGAGGT 
AATTCTTAGC 
TCAGGTTTTG 
TGTTACCGTT 
TCTTACCAGG 
GCTTTAGAAG 



ACTGTGGTAT 
ATGGTTATTG 
CTCAATGTAT 
GTGAACATTA 
GTTTTAACCT 
TGACAATGTC 
CTGGTCAAAG 



CTTATTACGT 550 

CTAAAAC CGG 600 

ATTTTGACTA 650 

CAGACCTCAA 700 

TYCCAGAAGG 750 

GAAATGGTTT 800 

ATTCAA 846 



10 



15 



20 



25 



30 



35 



40 



45 



2 ) INFORMATION FOR SEQ ID NO: 507 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 964 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 



GGAGCTGTTG 
CCGCCCAGAC 
CAGACCAGAG 
GGTCGTTTTC 
AGCTTGTCGA 
GGTGAGGAGA 
CCGTCGACCT 
TCGACACCTG 
ATGTCCATTG 
CGGTCGTGTT 
TTGGTGGCTC 
TTCAAGAAGT 
TCTCCGAGGT 
CCCCCGGATC 
CTGACTGAGG 
CCCTCAAATG 
TTACTAAGTA 
CATCTTTCAG 
GACAACGTCG 
TGGCCAACGA 



TCGTTGTCGC 
GGATGAAAGG 
AACATTTGCT 
GTTAACAAGG 
ACTTGAAATG 
CCCCCATCAT 
GAGCTTGGTG 
GATCCCCACC 
AGGAAGTGTT 
GAGCGTGGTA 
TACCACCCCT 
CCTGCGATGA 
ATCAAGCGTG 
CACCAAGGCT 
CTGAGGGTGG 
TTCATCCGTA 
GATCATTGCT 
CTGGCCTGGA 
AGATGATTTG 
TTCA 



AGCTTCTGAC 
ATTTGACGTT 
CCTTGCCCGC 
TCGATGCCGT 
CGTGAACTCC 
TTTTGGCTCT 
TCGAGAAGAT 
CCCGAGCGCG 
CTCTATCTCT 
TCCTCAAGAA 
ATCAAGACCA 
ATCTCGAGCT 
AGGACTTGAA 
CACACCGACT 
TCGTTCCAAC 
CTGCTGGTAT 
AACTTGTATT 
GAAGACCAAG 
CAAAACCCTC 



GGTCAAATGT 
TCTAACATCA 
CAGGTCGGTG 
TGAGGACCCA 
TCAGCCACTA 
GCTCTCTGTG 
TGACGAGCTA 
CCACTGATAA 
GGTCGTGGTA 
GGATTCCGAC 
AGGTCACAGA 
GGTGACAACT 
GCGTGGAATG 
TCATGGTCTC 
GGCTTCACCC 
GTAACCAAAG 
CCCTTCCGTA 
ACAAGAAGGC 
CACCCCATTG 



AATTGAATGC 50 

GTCTAGGCCT 100 

TCCAGAAGCT 150 

GAGATGTTGG 2 00 

CAGTTTTGAG 250 

CCCTCGAGTC 3 00 

TTGAACGCCG 350 

GCCTTTCCTC 400 

CCGTCGTCTC 450 

GTCGAAATTG 500 

TATCGAAACC 550 

CTGGTCTCCT 600 

GTTGTTGCTG 650 

CCTCTACGTC 700 

ACAAGTACCG 750 

TTTCCGCTAT 800 

GACGAAGCCG 850 

TATGCCTGGT 900 

CTGCCGAGGC 950 

964 



50 2 ) INFORMATION FOR SEQ ID NO: 508 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

312 
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(A) ORGANISM: Yarrowia lipolytics 

(B) STRAIN: ATCC 3 8295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 

5 

GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 200 

10 CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 3 00 

CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 350 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 400 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 450 

15 AACAACAAGC CCATCAAGGC TGTTGTTACC GGTATTGAGA TGTTCAAGAA 500 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 700 

20. TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 8 00 

CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



25 

2) INFORMATION FOR SEQ ID NO: 509 

( i ) SEQUENCE CHARACTER I STI CS : 
(A) LENGTH: 1067 bases 
30 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

45 CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 200 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 250 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 3 00 

CTACAAGGAG GACCGTTACA GCGAAATCCA GAAGGAAGTT CAGGGTTACC 3 50 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

50 GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 450 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 60 0 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 650 

55 AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

60 CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 950 

313 
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ACCCCAAGAC CATCAAGAAC GGCGACGCCG CTATGGTCGT CCTGAAGCCG 
TGCAAGCCCA TGGTCGTCGA GGCCTTCACT GAATACGCTC CCCTTGGTCG 
TTTCGCCGTG CGTGACG 



1000 
1050 
1067 



2 ) INFORMATION FOR SEQ ID NO : 510 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis . 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 

GTGAACGTGA ACGTGGTATT ACTATTGATA TTACCTTATG GAAGTTCGAG 50 

ACCACCAAGT ACTACTACAC CGTCATTGAT GCCCCTGGTC ACCGTGACTT 100 

CATCAAGAAC ATGATTACGG GTACTTCTCA AGCCGATGTT GCTATGCTTG 150 

TTGTACCAGC TGAGGCTGGT GGTTTCGAGG CCGCTTTCTC CAAGGAAGGA 2 00 

CAGACCCGTG AGCACGCTCT TTTGGCTTTC ACCCTTGGTG TCAAACAGAT 250 

CAT CTGTGC C ATTAACAAGA TGGACAAGTG CGACTACAAG GAGGACCGTT 3 00 

ACAGTGAAAT CCAGAAGGAA GTCCAGGGTT ACCTCAAGAA GGTCGGTTAC 3 50 

AATATTGAGA AGGTGCCCTT CGTTGCCATC TCCGGTTTCA TGGGAGACAA 4 00 

CATGGTTGAG CGTTCCACCA ACATGCCCTG GTATAAGGGA AAGACATTGG 450 

TCGAGGCCCT TGATCAGATG GAACCCCCAA AGAGGCCCGT TGACAAGCCA 500 

CTTCGTCTTC CCCTCCAGGG TGTCTACAAG ATCGGTGGTA TCGGTACCGT 550 

CCCCGTCGGT CGTGTTGAAA CTGGTATGTT GAAGGCTGGT ATGATTCTAA 600 

CCTTTGCTCC TAACCCA&TC ACCACTGAAT GCAAATCCGT TGAAATGCAC 650 

CACGAAACCG TTGAGGTTGC TTACCCCGGT GACAACGTCG GTTTCAACGT 700 
AAAGAACGTT TCTACTTCTG ACATTCGCAG TGGTCACGTT GCCTCTGATT % 750 

CTAAGAACGA CCCTGCCAAG GCTGCTGTTT CCTTCACTGC CCAGGTCATT 800 

GTGCTCAACC ACCCTGGTAC CATTAAGGCC GGTTACTGCC CCGTCGTCGA 850 

TTGCCACACC GCTCACATTT CATGTAAATT CGAAGAGATC ACCAGCCGTA 9 00 

TGGACAAGCG TACCGGTAAA TCTCTTGAGG AAAACCCCAA GACCATCAAG 950 

AACGGTGACG CTGCCATGGT TGTGCTCAAG CCAATGAAGC CCATGGTTGT 1000 

CGAATCCTTC ACTGAGTATG CTCCTCTTGG TCGTTTCGCT GTTCGTGAC 1049 



2) INFORMATION FOR SEQ ID NO : 511 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN: ATCC 11745 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 



TGGACAAGCT 
CTGTGGAAGT 
5 CGGCCACCGC 
ATGCCGCCAT 
ATCTCCAAGG 
GGGCGTGAAG 
TGCAGTACGC 

10 TACCTGAAGC 
CTCGGGCTGG 
GGTACAAGGG 
GTGCGTCCGG 
GATCGGCGGT 

15 TGAAGCCGGG 
GTGAAGTCGA 
CGACAACGTG 
GTGGTAACGT 
GACTTCACTG 

20 CGGCTACGCG 
TCGCGGACAT 
AAGAGCCCGA 
CCCGCAGAAG 
GCCGCTTCGC 

25 



GAAGGCGGAG 
TCGAGTCGCC 
GACTTCATCA 
TCTGATGATT 
ACGGCCAGAC 
CAGATGGTTG 
CCAGGCCCGC 
GCGTGGGCTA 
CAGGGCGACA 
TCCCACGCTG 
TGGACAAGCC 
AT CGGCACTG 
CGACGTTGTG 
TCGAGATGCA 
GGCTTCAACG 
GTGCGGCAAC 
CCCAGGTGAT 
CCGGTGCTGG 
CGAGTCCAAG 
AGGCCATCAA 
CCGATGTGCG 
TGTCCGCGAT 



CGCGAGCGCG 
CAAGTCCGTG 
AGAACATGAT 
GACTCGACCC 
CCGCGAGCAC 
TGTGCTGCAA 
TACGAGGAGA 
CAACCCGGAG 
ACATGATCGA 
CTGGAGGCGC 
GCTGCGCCTG 
TGCCCGTGGG 
GTGTTCGCGC 
CCACGAGCAG 
TGAAGAACGT 
ACGAAGAGCG 
CGTGCTGAAC 
ACTGCCACAC 
ATCGACCGCC 
GTCCGGCGAT 
TGGAGGTGTT 



GTATCACGAT 
TTCACGATCA 
CACCGGCACC 
AGGGTGGCTT 
GCCCTGCTTG 
CAAGATGGAC 
TCAGCAAGGA 
AAGGTGCGCT 
GAAGTCCGAC 
TCGACCTGCT 
CCCCTGCAGG 
CCGTGTGGAG 
CTGCCAACGT 
CTGGCTGAGG 
GTCCGTGAAG 
ACCCCCCGAA 
CACCCCGGCC 
GAGCCACATC 
GCTCTGGCAA 
GCGGCCATCG 
CAACGACTAC 
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CGATATTGCC 50 

TCGATGCCCC 100 

TCCCAGGCCG 150 

CGAGGCTGGC 2 00 

CCTTCACGCT 250 

GACAAGACGG 3 00 

GGTCGGCGCG 350 

TCATCCCGAT 4 00 

AACATGTCGT 450 

GGAGGCCCCC 500 

ACGTGTACAA 550 

ACCGGCGTGA 600 

GACGACCGAG 65 0 

CCGTGCCCGG 700 

GATATC CGCC 750 

GGAGGCGGCC 8 00 

AGATCAGCAA 850 

GCGTGCAAGT 900 

GGAGCTGGAG 950 

TGAAGATGAT 1000 

CCGCCGCTGG 1050 
1070 



2 ) INFORMATION FOR SEQ ID NO: 512 

30 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 
40 (B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

45 AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 250 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 300 

50 TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 4 50 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

55 TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 650 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 800 

60 TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 

315 
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ATTGCCATAC ATCACACATT GCATGTAAAT 

ATTGATAGAA GAACAGGTAA ATCCATGGAA 

TAAGAATGGA GATTCAGCAC TTGTTAAGAT 

GTGTTGAAGA ATTTGCTAAA TTCCCACCAT 
5 GA 



PCT/CA00/01150 

TCGAAGAATT ATTA^GCAAG 909^ 
GGAGGAGAAC CAGAA^FATAT &50 
TGTTCCAACT AAACCACTTT 1000 
TGGGAAGATT TGCTGTTAGA 1050 

1052 



2) INFORMATION FOR SEQ ID NO; 513 



10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 82 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Giardia lamblia 

(B) STRAIN: Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 

25 GACGAGCGCG AGCGCGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTCGA 50 

GACGAAGAAG TACATCGTCA CGATCATCGA CGCCCCGGGC CACCGCGACT 100 

TCATCAAGAA CATGATCACG GGGACGTCCC AGGCCGACGT CGCGATCCTC 150 

GTCGTCGCGG CGGGCCAGGG CGAGTTCGAG GCCGGGATCT CGAAGGACGG 2 00 

CCAGACGCGC GAGCACGCGA CCCTTGCGAA CACGCTCGGG ATCAAGACGA 2 50 

30 TGATCATCTG CGTCAACAAG ATGGACGACG GCCAGGTCAA GTACTCGAAG 3 00 

GAGCGCTACG ACGAGATCAA GGGCGAGATG ATGAAGCAGC TCAAGAACAT 3 50 

CGGCTGGAAG AAGGCCGAGG AGTTCGACTA CATCCCGACG TCCGGCTGGA 4 00 

CCGGGGACAA CATCATGGAG AAGTCCGACA AGATGCCCTG GTACGAGGGC 450 

CCGTGCCTGA TCGACGCGAT CGACGGGCTC AAGGCCCCGA AGCGCCCGAC 500 

3 5 CGACAAGCCC CTCCGCCTCC CGATCCAGGA CGTCTACAAG ATCTCGGGCG 550 

TCGGGACCGT CCCCGCGGGC CGCGTCGAGA CGGGCGAGCT CGCGCCCGGG 600 

ATGAAGGTCG TCTTCGCCCC GACGTCCCAG GTCTCGGAGG TCAAGTCCGT 650 

CGAGATGCAC CACGAGGAGC TCAAGAAGGC CGGGCCCGGG GACAACGTCG 700 

GCTTCAACGT CCGCGGGCTC GCCGTCAAGG ACCTCAAGAA GGGCTACGTC 750 

40 GTCGGGGACG TGACGAACGA CCCGCCCGTC GGCTGCAAGA GCTTCACCGC 800 

CCAGGTCATC GTCATGAACC ACCCGAAGAA GATCCAGCCC GGCTACACGC 850 

CCGTCATCGA CTGCCACACC GCGCACATCG CGTGCCAGTT CCAGCTCTTC 900 

CTCCAGAAGC TCGACAAGCG CACGCTCAAG CCCGAGATGG AGAACCCGCC 950 

CGACGCAGGC CGCGGCGATT GCATCATCGT CAAGATGGTC CCCCAGAAGC 1000 

45 CCCTGTGCTG CGAGACGTTC AACGACTACG CGCCCCTCGG CCGCTTCGCC 1050 

GTCCGCGACA TGCGCCAAAC CGTTGCCGTC GG 1082 



50 2) INFORMATION FOR SEQ ID NO: 514 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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WO 01/23604 PCT/CA00/01 150 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

5 

ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 50 

ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTTCACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 150 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 2 00 

10 TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 250 

TGCCTTCACT CTTGGCGTGA AGCAGATGGT GGTGTGCTGC AACAAGATGG 3 00 

ACGACAAGAC GGTGACGTAC GCGCAGTCGC GCTACGATGA GATCAGCAAG 3 50 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 4 00 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 4 50 

15 ACAACATGCC GTGGTACAAG GGTCCCACGC TGCTGGACGC GCTCGACATG 500 

CTGGAGCCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 600 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 700 

20 GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 8 00 

AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGC CGGTGCT GGACTGCCAC ACGAGCCACA 9 00 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

25 AAGGAGCTGG AGAAGAACCC CAAGGCGATC AAGTCTGGCG ATGCCGCGAT 1000 

CGTGAAGATG GTGC CGCAGA AGC CGATGTG CGTGGAGGTG TTCAACGACT 1050 

ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



30 



40 



2) INFORMATION FOR SEQ ID NO: 515 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1104 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

50 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

55 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

60 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCAC CACG AG CAGCTGGCGG 70 0 
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WO 01/23604 

AGGCGCAGCC CGGCGACAAC GTCGGCTTO? 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GC&CCGGTGC 
5 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



PCT/CA00/01150 

AcBtfdffitaK* cfSTGite'GSTG - -7<3d 

AACTCGAAGA ACGACCCGCC So6 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1104 



2) INFORMATION FOR SEQ ID NO: 516 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
25 (B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

30 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 10 0 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

35 GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 45 0 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 50 0 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

40 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

45 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 90 0 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

50 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 110 0 

CGTCGG 1106 



55 2) INFORMATION FOR SEQ ID NO : 517 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 

10 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

15 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCG CGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

20 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

25 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

30 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTG 1099 



35 



2) INFORMATION FOR SEQ ID NO: 518 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 1098 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania. infantum 

(B) STRAIN: MOU 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 
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GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 5QJb 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

5 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 75 0 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

10 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 100 0 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 10 50 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 1098 



15 



30 



2) INFORMATION FOR SEQ ID NO: 519 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN : ATCC 50120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

3 5 CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 150 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 250 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

40 GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 450 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

45 ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

50 ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 

55 

2) INFORMATION FOR SEQ ID NO: 520 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 
10 (B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

15 GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 15 0 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 2 00 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 250 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

2 0 GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 350 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 45 0 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 50 0 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

25 AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGT CGGTGAA GGACATC CGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

3 0 CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 850 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGC CGC 90 0 

TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 95 0 

GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 100 0 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

35 GGCCGCTTTG CCGTGCGCGA C 1071 



2) INFORMATION FOR SEQ ID NO: 521 



40 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

55 TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 10 0 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATCACCGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 2 00 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 250 

60 ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 3 00 
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AACAAGATGG ACGACAAGAC GGTGCAGTAC GCGCAGGCGC GCTACGAGGA 350/ 

GATCAGCAAG GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG 400 

AGAAGGTGCG CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC 450 

GAGAAGTCCG ACAACATGTC GTGGTACAAG GGTCCCACGC TGCTGGAGGC 500 

5 GCTGGACATG CTGGAGGCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC 550 

TGCCCCTGCA GGACGTGTAC AAGATCGGCG GCATTGGCAC GGTGCCGGTG 600 

GGC CGTGTGG AGACCGGCAT CATGAAGCCC GGCGACGTGG TGACGTTCGC 650 

GCCCGCCAAC GTGACGACGG AGGTGAAGTC GATCGAGATG CACCACGAGC 700 

AGCTGCAGGA GGCTGTGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC 750 

10 GTGTCGGTGA AGGACATCCG CCGTGGTAAC GTGTGTGGCA ACTCGAAGAA 800 

CGACCCGCCG AAGGAGGCGG CTGACTTCAC GGCGCAGGTG ATCGTGCTGA 850 

ACCACCCCGG CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC 900 

ACCAGCCACA TCGCGTGCCG CTTCGCGGAC ATCGAGTCGA AGATCGACCG 950 

CCGCTCCGGC AAGGAGCTGG AGAAGAACCC CAAGTCCATC AAGTCCGGCG 1000 

15 ACGCCGCCAT CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG 1050 

TTCAACGACT ACCCGCCGCT GGGCCGCTTT GCGGTGCGCG ACATGCGCCA 1100 

AACCGTTGCC GTCG 1114 



20 
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2) INFORMATION FOR SEQ ID NO: 522 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

40 ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

45 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

50 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

55 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 



60 
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2) INFORMATION FOR SEQ ID NO: 523 



5 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 



(C) STRANDEDNESS : Double 



(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 

2 0 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 
TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 
ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 
CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 
TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 

25 GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 
GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 
GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 
GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 
GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 

30. AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 
GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 
CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 
AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 

35 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 
ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 

40 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 
CGTCG 



45 2) INFORMATION FOR SEQ ID NO: 524 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1105 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 



50 



(C) STRANDEDNESS: Double 



(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 



60 



323 
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TACGCGTGGG 
GATCGACATT 
TCATCGATGC 
ACGTCGCAGG 
5 CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
GGAGGTGAGC 
GCTTCATCCC 

10 GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 
GAGACCGGGA 
CGTGACGACT 

15 AGGCGCAGCC 
AAGGACATCC 
GAAGGAGGCG 
GCCAGATCAG 
ATCGCGTGCC 

20 CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCGGC 
GGCATCTCGA 
TCTTGGCGTG 
CGGTGATGTA 
GCGTACCTGA 
GATCTCGGGG 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGTAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTTCA 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGG1GC 
GATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



GAGCGCGAGC 
GCCGAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



PCT/CAOO/01150 

GCGGCAT.CAC 50 

GTGTTCACGA 100 

GATCACGGGC 150 

CGCATGGTGG 200 

CACGCGCTGC 250 

CAACAAGATG 300 

AGATCAGCAA 350 

GAGAAGGTGC 400 

CGACAAGTCG 450 

CGCTCGACAT 500 

CTGCCCCTGC 550 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 700 

CGTGTCGGTG 750 

ACGACCCGCC 800 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GACGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTT 10 98 



25 



2 ) INFORMATION FOR SEQ ID NO: 525 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1081 bases 
30 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

45 CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 200 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 300 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 350 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 4 00 

50 TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

55 CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

60 CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 

324 



SUBSTITUTE SHEET (RULE 26) 



20 



WO 01/23604 PCT/CA00/01150 

CCAAGGCGAT CAAGTCCGGC GATGC CGCGA TCGTGAAGAT GGTG(3cGCAG lOOof 
AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 10^0 
TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



2) INFORMATION FOR SEQ ID NO: 526 



(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmajiia. tropica. 

(B) STRAIN: ATCC 5012 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 



AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 

25 CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 2 00 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 2 50 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 3 00 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 3 50 

30 CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 40 0 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 

TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

35 GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 650 

CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 

CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 8 00 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

40 CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 900 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 100 0 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

45 TG . 1102 



50 



2) INFORMATION FOR SEQ ID NO: 52 7 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Neospora caninum 

325 
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WO 01/23604 

(B) STRAIN: Suarez-4 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

5 GGACRAACTT AAAGCTGAAC GTGAGCGTGG TATCACCATT GATATCTCCC 50 

TGTGGAAATT TGAGACCAGC AAGTACTATG TTACCATCAT TGATGCCCCA 100 

GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGCCGGTA 2 00 

TCTCCAAGAA CGGGC AG AC C CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

10 GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 30 0 

ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 

AYNTTAAAAA AATTGGYTAC AACCCCGACA CAGTANCATT TGKGCCAATT 400 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGTCM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

15 CCCTGCTTGA AGCTYTGGAT TGCATTYTGC CACCAAYTTG CCCAACTGAC 550 

AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 700 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 750 

20 CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 800 

GTGACAGCAA AAATGATC C A CCCATGGAAG CTGCTGGCTT CACAGCTCAG 850 

GTGATTATTT TGAACCATCC AGGCCAAATC AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC ACATTGCTTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 10 0 0 

25 TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 10 50 

GTGTGT CGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGCTGTGC 1100 

GTGAC 1105 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 52 8 



(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 935 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 30001 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 528 

GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

50 TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CGAAATCACA 2 00 

GCCGAAATGA CACGCATCCT TACAGGCATC GGCTACAAGC CAGAAATGTT 250 

CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 300 

CTCCAAACAT GCCATGGTAC AATGGCCCAT ACCTTCTTGA AGCCCTCGAT 350 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 400 

55 CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTCCA GTCGGCCGTG 450 

TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 500 

ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGCTGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 650 

60 CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 
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AGGCAAGATC CACGCCGGCT 
ACATCGCCTG CAAGTTCGAC 
GGCAAGAAGG CTACAGAGAA 
TATCGTCGAG GTTGTCCCAT 
5 AGTACCCACC ACTCGGCCGT 



ACCAGCCAGT TTTCGACTGC 
AAGCTCATCC AGCGTATCGA 
CCCAGAATAC ATTCAGAAGG 
CCAAGCCACT CGTCGTCGAG 
TTCGCCATCC GTGAT 



PCT/CAOO/01150 

GA.CAOAGCTC 750 ) 

TCGTCGGCAC 800/ 
ATGATGCCGC 850 
TCCTTCCAGG 900 

935 



2 ) INFORMATION FOR SEQ ID NO: 529 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATR07 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 

25 AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

30 TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 3 00 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 3 50 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 400 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 500 

35 TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 600 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

40 AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

45 AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 



50 2) INFORMATION FOR SEQ ID NO: 530 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE : Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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PCT/CA00/01150 



(A) ORGANISM: Crithiaia zascxcuiaza 

(B) STRAIN: ATCC 11745 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 0 

5 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG 
CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG 
CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT 
AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC 

10 CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG 
GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC 
AAGCTGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA 
GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG 
TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA 

15 AACGTGGCCA AGGGCCACGG TGGTTTCTCC GTGTTCGCCG 
GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG 
TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA 
AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG 
GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG 

20 ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC 
GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG 
TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC 
CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA 
GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG 

25 TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG 
GTATCATGGA CCC CGATGTG ATCGACGTGG ACCACTACAA 
GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG 
CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT 
GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA 

30 GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG 
GTCGTTCTCT GGC CTGCTGA TGGGCTCGTA CGACCAGATC 



TGACGGAGGA 50 

CACTTGGACG 100 

GCTGAAGCTG 150 

CGGTTGGCCG 200 

ATCGACCAGC 250 

CGAGGCGCCG 300 

CCGGCATCAA 350 

ATCGGGCTGT 400 

GCTGATCAAC 450 

GCGTTGGCGA 500 

CAGTCGAAGG 550 

CGGCCAGATG 600 

CGCTGACGAT 650 

CTGTTCATCG 700 

CGCCCTGCTG 7 50 

CCGAGGATCT 800 

TCGATTACGT 850 

TCCGGCGCCG 900 

ACCGCGCGGT 950 

TGCGCGTCGC 1000 

CGTTGCGCAG 1050 

ATATCATTGC 1100 

GTGGTGGACC 1150 

GGTGGCCGAG 12 00 

ACACAGTGGA 12 50 

CCGGAGA 1297 



35 2) INFORMATION FOR SEQ ID NO: 531 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

55 GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

60 CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

328 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



AGCGCACGCG 
GTGATTGACC 
GAACGAGCCC 
TGGCGGAGTA 
5 GACAACATCT 
GGGCCGCATT 
TTGGTATGCT 
TCCGTGCAGG 
CGCGACGACG 

10 TGGCGGAGTC 
CGTATCATGG 
GGATATCGTG 
CGGTGCTTGG 
CGCGCGCGCA 

15 GGTGTTCACG 
AGTCGTTCTC 



CGAGGGCACG 
TGAAGGGCGA 
CCGGGTGCGC 
CTTCCGCGAC 
TCCGCTTCAC 
CCGGCCGCCG 
GCAGGAGCGC 
CCGTGTACGT 
TTCTCGCACC 
GGGCATCTAC 
ACCCTGATGT 
CAGATGCTGA 
CATCGACGAG 
AGGTGACCCG 
GGCATGACGG 
TGGCCTGCTG 



GACCTGTACC 
GTCGAAGTGY 
GCGCGCGCGT 
GTGGAGGGCC 
GCAGGCGAAC 
TGGGCTACCA 
ATCACGTCGA 
GCCAGCGGAT 
TGGACGCGAC 
CCTGCCGTGA 
GATCGATGTG 
CCAAGTACAA 
CTGAGCGAGG 
GTTCCTGTCG 
GCCACTACGT 
ATGGGGTCGT 



TGGAGATGAT 
GTGCTTGTGT 
TGCGCAGTCT 
AGAACGTGCT 
TCCGAGGTGT 
GCCGACGCTT 
CAACGAAGGG 
GATATCACGG 
GACTGTGCTG 
ACCCGCTGGA 
GACCACTACA 
GGAGCTGCAG 
AAGACAAGGT 
CAGCCGTTCC 
GCAGCTGGTC 
ACGACCAGAT 



GCAGT&3AAG S^jl 
ATGGGCAGAT 600 
GCGCTGACGA 650 
GCTGTTCATC 700 
CTGCGCTGCT 750 
GCGGAGGATC 800 
GTCGAT CACG 850 
ATCCCGCGCC 900 
GACCGCGCGG 950 

GTGCGCGTCG 10 00 

ACGTTGCGCA 1050 

GATATCATTG 1100 

TGTTGTGGAC 1150 

AGGTTGCGGA 12 00 

GACACGGTGG 1250 

CCCGGAGA 1298 



20 2) INFORMATION FOR SEQ ID NO: 532 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 



TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 
CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 
CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 
AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

40 TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 
GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 300 
AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 
GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 4 00 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

45 AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 
GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 
TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 
AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 
GGCGGAGTAC TTCCGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

50 ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 
GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 
TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 
CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 
GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

55 GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

60 GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 1250 
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GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGG^QA 1297. 



5 2) INFORMATION FOR SEQ ID NO: 533 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 



CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

25 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

30 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

35 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

40 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

45 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



50 2) INFORMATION FOR SEQ ID NO: 534 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania Infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 

5 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

10 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

AAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

15 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

20 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

25 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GAC C ACT ACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

30 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 12 98 



35 2) INFORMATION FOR SEQ ID NO: 535 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1301 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii.) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 



GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

55 GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 300 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 3 50 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 400 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

60 CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 
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GCGAGCGCAC GCGCGAGGGC ACGGACCTGT 
AAGGTGATTG ACCTGAAGGG CGAGTCGAAG 
GATGAACGAG CCCCCGGGTG CGCGCGCGCG 
CGATGGCGGA GTACTTCCGC GACGTGGAGG 
5 ATCGACAACA TCTTCCGCTT CACGCAGGCG 
GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA 
ATCTTGGTAT GCTGCAGGAG CGCATCACGT 
ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG 
GCCCGCGACG ACGTTCTCGC ACCTTGACGC 

10 CGGTGGCGGA GTCGGGCATC TACCCTGCCG 
TCGCGTATCA TGGACCCCGA TGTGATCGAT 
GCAGGATATC GTGCAGATGC TGACCAAGTA 
TTGCGGTGCT TGGCATCGAC GAGCTGAGCG 
GACCGCGCGC GCAAGGTGAC CCGGTTCCTG 

15 GGAGGTGTTC ACGGGCATGA CGGGCCACTA 
TGGAGTCGTT CTCTGGCTTG CTGATGGGGT 
A 



PCT/CAOO/01150 

ACCTGGAGAT GATGCAGTCG 55T) 

TGCGTGCTTG TGTACGGGCA 60Qs 

CGTTGCGCAG TCTGCGCTGA 650 

GCCAGAACGT GCTGCTGTTC 700 

AACTCCGAGG TGTCCGCGCT 750 

CCAGCCGACG CTTGCGGAGG 800 

CGACAACGAA GGGGTCGATC 850 

GATGATATCA CGGATCCCGC 900 

GACGACTGTG CTGGACCGCG 950 

TGAACCCGCT GGAGTGCGCG 1000 

GTGGACCACT ACAACGTTGC 1050 

CAAGGAGCTG CAGGACATCA 1100 

AGGAAGACAA GGTTGTGGTG 1150 

TCGCAGCCGT TCCAGGTTGC 12 00 

CGTGCAGCTG GTCGACACGG 1250 

CGTACGACCA GATCCCGGAG 130 0 

1301 



20 



30 



2) INFORMATION FOR SEQ ID NO : 536 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1298 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6 

CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 50 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GCACTTGGAC 10 0 

GCGAACACCG GTCGCTGGAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

40 GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 200 

GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC GATTGACCAG 250 

CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 300 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 400 

45 TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 450 

TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTQTTTGCT GGCGTGGGCG 500 

AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 600 

GAACGAGCCC CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

50 TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 850 

TCTGTGCAGG CCGTGTACGT GCCTGCGGAT GATATCACGG ACCCGGCGCC 900 

55. CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 950 

TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CCGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

60 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 
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GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACAdGGTGG 1251 
AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGAGA 12 9^ 



2) INFORMATION FOR SEQ ID NO: 53 7 

( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1297 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 
ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 
GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

25 GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 
GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 250 
CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 
AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

30 TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 
CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 
AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 
GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 
GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

35 TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 
GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 750 
GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 
TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 
TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

40 CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

45 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 1297 



50 

2) INFORMATION FOR SEQ ID NO: 538 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1297 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

10 AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 2 50 

GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 4 00 

15 TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 450 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 60 0 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

2 0 GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 900 

25 GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

3 0 GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 12 50 

GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 1297 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 53 9 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 9 
CCITACATCC TBGTYGCICT IAACAAG 27 

2) INFORMATION FOR SEQ ID NO: 540 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540 
GGDGCITCYT CRTCGWAITC CTG 



2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTKGAAATGT TCCGCAAGCT GCT 



2) INFORMATION FOR SEQ ID NO: 542 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
CGGAARTAGA ACTGSGGACG GTAG 



2) INFORMATION FOR SEQ ID NO: 543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
ATCTTAGTAG TTTCTGCTGC TGA 



2 ) INFORMATION FOR SEQ ID NO: 544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AYGTTGTCGC CMGGCATTMC CAT 

10 

2) INFORMATION FOR SEQ ID NO: 545 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 
TACATCCTBG TYGCICTIAA CAAGTG 2 6 

25 



2) INFORMATION FOR SEQ ID NO: 54 6 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) ' TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546 
40 CCRCGICCGG TRATGGTGAA GAT 23 
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23 



2) INFORMATION FOR SEQ ID NO: 547 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547 

55 

GTACAGTTGC TTCAGGACGT ATC 2 3 



60 2) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 
ACGTTCGATT TCATCACGTT G 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GAACGTGATA CTGACAAACC TTTA 



2) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAAGAAGAAC ACCAACGTTG 



2 ) INFORMATION FOR SEQ ID NO: 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
GAAGAAAAAA TCTTCGAACT GGCTA 
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2) INFORMATION FOR SEQ ID NO: 552 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 

15 

TACACGGCCG GTGACTACG 19 



20 2) INFORMATION FOR SEQ ID NO: 553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 

GGCCGTGTTG AACGTGGTCA AATCA 25 



35 



45 



50 



60 



2) INFORMATION FOR SEQ ID NO: 554 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554 
GTTCCTTACA TCGTTGTTTT TCTC 24 



2) INFORMATION FOR SEQ ID NO: 555 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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40 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
TCTCGAACTT TCTCTATGTA TGCA 24 

2) INFORMATION FOR SEQ ID NO: 556 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 

20 

CGGCGCNATC YTSGTTGTTG C 21 

25 2 ) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 

CCMAGGCATR ACCATCTCGG TG 22 



2 ) INFORMATION FOR SEQ ID NO: 558 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
TCI TTYAART AYGC I TGGGT 20 

2) INFORMATION FOR SEQ ID NO: 559 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CCGACRGCRA YIGTYTGICK CAT 23 



2) INFORMATION FOR SEQ ID NO: 560 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
25 GAYTTCATYA ARAAYATGAT YAC 23 



2) INFORMATION FOR SEQ ID NO: 561 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 

40 

ACIGTICGGC CRCCCTCACG GAT 23 



45 2) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 

CARATGRAYG ARCCICCIGG IGYIMGIATG 30 
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2) INFORMATION FOR SEQ ID NO: 563 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 
GGYTGRTAIC CIACIGCIGA IGGCAT 2 6 



2) INFORMATION FOR SEQ ID NO: 564 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
TAYGGI CARA TGAAYGARCC ICCIGGIAA 29 

30 

2) INFORMATION FOR SEQ ID NO: 565 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
4 5 GGYTGRTAI C C I AC I GC I GA I GGDAT 2 6 



2) INFORMATION FOR SEQ ID NO: 566 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 

60 
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2) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 567 
TCRTC I GC I G G I ACRTAI AY I GCYTG 



2) INFORMATION FOR SEQ ID NO: 568 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568 
RTIATIGGIG CIGTIRTIGA YGT 



2) INFORMATION FOR SEQ ID NO: 569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 
RTIRTIGGIS CIGTIRTIGA TAT 



2) INFORMATION FOR SEQ ID NO: 570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
5 RTIRYIGGIC CIGTIRTIGA YGT 23 
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2 ) INFORMATION FOR SEQ ID NO : 571 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 

20 

RTIRTIGGIC CIGTIRTIGA TGT 23 

25 ' 2) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 

RTIRTIGGIS CIGTIRTIGA 20 



2) INFORMATION FOR SEQ ID NO: 573 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573 
CCICCIACCA TRTARAAIGC 2 0 

2) INFORMATION FOR SEQ ID NO: 574 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
AT I GC I ATGG AYGGIACIGA RGG 23 



2) INFORMATION FOR SEQ ID NO: 575 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
25 TIACCATTTC AGTACCTTCT GGTAA 25 



2 ) INFORMATION FOR SEQ ID NO: 576 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 

40 

AACTTCRTCA AGAAGGTYGG TTACAA 26 

45 2 ) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 

CATGATTGAA CCATCCACCA 2 0 
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2 ) INFORMATION FOR SEQ ID NO: 578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 20 



2 ) INFORMATION FOR SEQ ID NO: 5 79 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579 
GAAGGCCGTG CTGGTGAGAA 2 0 

30 



2 ) INFORMATION FOR SEQ ID NO: 580 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
45 GCTAAACCAG CTACAATCAC TCCAC 25 



50 



2) INFORMATION FOR SEQ ID NO: 581 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 

60 
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5 2) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 

TTTCAACTTC GTCGTTGACA CGAACAGT 28 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 583 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 35 

2) INFORMATION FOR SEQ ID NO: 584 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 20 

50 



2) INFORMATION FOR SEQ ID NO: 585 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
5 ACCACATACT GAATTCAAAG 2 0 
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2) INFORMATION FOR SEQ ID NO: 586 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 

20 

CAGAAGTATA CGTATTATCA 20 

25 2) INFORMATION FOR SEQ ID NO: 587 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 

CGTATTATCA AAAGACGAAG 20 



2) INFORMATION FOR SEQ ID NO : 588 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 20 

2) INFORMATION FOR SEQ ID NO : 589 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 9 
GCACGAAACT TCTAAAACAA 20 



2) INFORMATION FOR SEQ ID NO: 590 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
25 TATACGTATT ATCTAAAGAT 20 



30 



2) INFORMATION FOR SEQ ID NO: 591 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 

40 

TCCTGGTTCT ATTACACCAC 20 



45 2) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 

CAAAGCTGAA GTATACGTAT 20 
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2) INFORMATION FOR SEQ ID NO: 593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 93 
TTCACTAACT ATCGCCCACA 20 

2) INFORMATION FOR SEQ ID NO: 594 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 20 

30 

2 ) INFORMATION FOR SEQ ID NO: 595 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595 
45 TTAAAGCAGA CGTATACGTT 20 



2 ) INFORMATION FOR SEQ ID NO: 596 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY : . Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 

60 
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2) INFORMATION FOR SEQ ID NO: 5 97 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 97 
ATTGGTAT C A AAGAAACTTC 



2) INFORMATION FOR SEQ ID NO : 598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT 



2) INFORMATION FOR SEQ ID NO: 599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CGGTGAAGAA ATCGAAATCA 



2) INFORMATION FOR SEQ ID NO: 60 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
ATGCAAGAAG AATCAAGCAA 



2) INFORMATION FOR SEQ ID NO: 6 01 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 
GTTTCACGTG ATGATGTACA 



2) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 
AAGTTGAAGT TGTTGGTATT 



2) INFORMATION FOR SEQ ID NO: 603 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
GGTATTAAAG ACGAAACATC 



2) INFORMATION FOR SEQ ID NO: 604 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 20 



2 ) INFORMATION FOR SEQ ID NO: 605 

15 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
25 GAAATGTTCC GTAAATTATT 20 



30 



2 ) INFORMATION FOR SEQ ID NO: 606 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 

40 

ATTAGACTAC GCTGAAGCTG 20 



45 2 ) INFORMATION FOR SEQ ID NO: 607 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 29212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 
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CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 56 

GTGAACATAT CTTATTATCA CGTAACGTTG GTGTACCATA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 200 

5 ATGTTCCAGT TATCGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGACGAG 2 50 

TCTTATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG ACGTGTTGAA 40 0 

CGTGGTGAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

10 CGAAACATCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GACAACTTCG GTGCTTTATT ACGTGGTGTA 550 

GCACGTGAAG ATATCGAACG TGGACAAGTA TTAGCTAAAC CAGCTACAAT 60 0 

CACTCCACAC ACAAAATTCA AAGCTGAAGT ATACGTATTA TCAAAAGAAG 65 0 

AAGGCGGACG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 70 0 

15 TTCCGTACAA CAGACGTTAC TGGTGTTGTA GAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTTGCTAT GGACGTTGAA TTAATTCACC 800 

CAATCGCTAT CGAAGACGGA A 821 
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2) INFORMATION FOR SEQ ID NO : 6 08 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 

CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCT CGTCAAGTTG GTGTTCCTTA CATCGTTGTA 10 0 

TTCTTGAACA AAGTAGACAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

4 0 TGAAATGGAA GTTCGTGACC TATTAACAGA ATACRAATTC CCTGGTGRCG 200 

ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA AAGCTCTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATTCT TGAAT TAATG GCTGCAGTTG ACGAATACAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 350 

ACGTGTTCTC AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

45 CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAGTTGTTG GTATTGCTGA 450 

AG AAACTT CA AAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACYACGC TGAAGCTGGA GACRACATTG GTGCTTTACT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

50 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



2 ) INFORMATION FOR SEQ ID NO: 609 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinairum 

(B) STRAIN: ATCC 49573 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 



CGGTGCGATC 
GTGAACACAT 
TTCTTGAACA 
TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGACAAG 
CGAAACTGCT 
TAGACTATGC 
GCTCGTGAAG 
CACACCTCAT 
AAGGTGGRCG 
TTCCGTACAA 
A* 



TTAGTAGTAT 
CTTGTTATCA 
AAATGGATAT 
GTTCGTGACC 
AATCGCCGGT 
AAAAAATCAT 
GAACGTGATA 
AATCACTGGA 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCCAACG 
ACAAAATTCA 
TCACACTCCA 
CTGACGTAAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGAYGAC 
TATTGTCTGA 
TCTGCTTTGA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACTG 
TGATGAAGTA 
TAACAGGTGT 
GATAACATTG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCACTA 
TGGTGTTGTT 



CGGTCCTATG 
GCGWACCATA 
GAAGAATTGC 
ATATGACTTC 
AAGCTCTTGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTT C 
GTGCATTGCT 
TTGGCTAAAG 
TTATGTTTTG 
ACTACCGTCC 
GAATTAC C AG 



CCTCAAACTC 50 

CATCGTTGTT 100 

TAGAATTAGT 150 

CCAGGCGACG 2 00 

AGGAGATCCT 250 

ACGAATACGT 3 00 

CCAGTCGAAG 350 

CCGTGTTGAA 4 00 

GTATTGCTGA 450 

CGTAAATTGT 500 

ACGTGGGGTT 550 

CTGGTACRAT 600 

ACAAAAGAAG 65 0 

TCAGTTCTAC 70 0 

AAGGAACTGA 750 

751 



30 



40 



45 



4 



2) INFORMATION FOR SEQ ID NO: 610 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Haemophilus influenzae Rd 

(B) STRAIN: KW20 

(C) ACCESSION NUMBER: extracted from U3273 9 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 



AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 100 

50 GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 3 00 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 3 50 

55 TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 400 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 4 50 

GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

ACATCGGTGC ATTATTACGT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 600 

60 CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 

354 
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AGAAGTGTAC GTATTATCAA AAGATGAAGG 
TCAAAGGTTA CCGTCCACAA TTCTATTTCC 
ACAATCGAAT TACCAGAAGG CGTGGAAATG 
CAAGATGACA GTAAGCTTAA TCCACCCAAT 
5 GTTTCGCAAT CCGTGAAGGT GGCCGTACAG 



TGGTCGTCAT ACTCCA^TCT 70 0 

GTACAACAGA CGTGACTGGT 750- 

GTAATGCCAG GCGATAACAT 800 

TGCGATGGAT CAAGGTTTAC 850 

TAGGTGCAGG C 8 91 



2) INFORMATION FOR SEQ ID NO: 611 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

25 CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 2 50 

30 GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 4 50 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

35 ACTACGCTGA- AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 55 0 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 600 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

40 GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 80 0 

TCGCTATCGA AGACGGAA 818 



45 2) INFORMATION FOR SEQ ID NO: 612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 

355 
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GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAbACCCG 5/) 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATTGTGT 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

5 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 4 00 

10 GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

15 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTC 825 



20 



35 



55 



2) INFORMATION FOR SEQ ID NO: 613 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 



GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 100 

40 TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMSCTGGA TACTTACATC 300 

CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

45 CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 400 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTCAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

50 CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 778 



2) INFORMATION FOR SEQ ID NO: 614 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 653 bases 

356 
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WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614 



GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

15 CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 200 

ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 300 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 350 

20 GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 400 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 450 

AAGTTCAAAA AGCAATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 500 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTCTATTAC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

25 CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 



30 2) INFORMATION FOR SEQ ID NO: 615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 



CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AACACGTGAA 50 

CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

50 CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 250 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 300 

CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 350 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 400 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 450 

55 CACAAAAAGC TGTTGTAACT GGGGTCGAAA TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GTATCCAACG 550 

TGAAGATATC GAACGTGGAC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 700 

60 TACAACTGAC GTAACTGGAA CAATCGTTTT ACCTGGAGGC ACTGAAATGG 750 

357 
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TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATC\JAGTT 8QO 
GCCATCGAAA ACGGAACAAC TTTCTCTAT 82 9 



2) INFORMATION FOR SEQ ID NO: 616 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

2 5 GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 2 00 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 300 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 350 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 400 

30 GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

35 AGGTGGCCGT CATACGCCA 669 



40 



2) INFORMATION FOR SEQ ID NO: 617 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 617 

55 CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 200 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 250 

60 GAACAAGAAG AAGTAATTAT GAACTTGATG GATACTGTGG ATGAATATAT 3 00 

358 
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CCCAACTCCA GAACGTGACA 
ATGTCTTCAC AATTACTGGT 
CGTGGTAAAG TCAACGTTGG 
AGAAACACAA AAAGCTGTTG 
5 TGGATTATGG TGAAGCTGGT 
ACTCGTGATG AAGTAGAACG 
TACACCGCAT ACCAAATTTA 
AAGGTGGACG TCATACTCCT 
TTCCGTACAA CTGATGTGAC 
10 AATGGTAATG CCTGGTGATA 
CAATCGCCGT TGAAAAAGGG 



ATGATAAACC ATTCTTGTTA 
CGTGGTACTG TTGCTTCAGG 
TGATGAAATT GAAATTATCG 
TAACCGGTTT GGAAATGTTC 
GATAACGTTG GGGTCTTATT 
TGGTCAAGTA TTAGCAAAAC 
AAGGTGAAGT TTATATCTTA 
TTCTTTAATA ACTATCGTCC 
TGGTAACATC GCATTACCTG 
ATGTAACAAT TGAAGTTGAA 
ACTACTTTCT CAATT 



PCT/CAOO/01150 

CCAGTSqGAAG 3 5Q/ 

TCGTATCGAC 400 

GAATTAAACC 450 

CGTAAAACTT 500 

ACGTGGGATT 550 

CAGGTTCCAT 600 

ACAAAAGAAG 650 

TCAATTTTAT 700 

AAGGAACTGA 750 

TTGATTCATC 800 

835 



15 2) INFORMATION FOR SEQ ID NO: 618 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 



GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 10 0 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 200 

35 TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 3 00 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 3 50 
TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC . 400 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

40 GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 

45 



2) INFORMATION FOR SEQ ID NO: 619 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
60 (C) ACCESSION NUMBER: K00428 

359 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 

TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 50 

5 ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 100 

GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 150 

AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 2 00 

AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 250 

TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 300 

10 GTAATTTCTA GGACCTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 3 50 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 4 00 

ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 450 

GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 500 

AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 550 

15 CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 600 

AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 650 

AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 700 

CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 750 

ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 800 

20 ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTCCA ATTATCATGG 850 

GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 900 

GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACACCTGA 950 

AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 1000 

TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 1050 

25 AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 1100 

AACAACAGTT ACTGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 1150 

TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 12 00 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 1250 

AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 1300 

30 ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 1350 

GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 14 00 

GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 1450 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 1500 

ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 1550 

35 CAA.CTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 16 00 

ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 1650 

TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 1700 

ATGTGCACAC ACC 1713 

40 

2) INFORMATION FOR SEQ ID NO: 620 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
ATTGGTGCAT TGCTACGT 18 

55 



2) INFORMATION FOR SEQ ID NO: 621 

60 (i) SEQUENCE CHARACTERISTICS: 

360 
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(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
10 (B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 

TGGTG CAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

15 GTGAACACAT TTTATTGTCA CGCCAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 200 

ACACTCCTGT GATCAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 250 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 3 00 

20 CCCAACACCA GAACGTGATA CAGATAAACC ATTACTATTG CCAGTGGAAG 3 50 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 4 00 

CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 450 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AGAAATGTTC CGTAAAACGT 500 

TAGATTACGG GGAAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

25 CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 600 

TACTCCACAT ACAAAATTCA AAGCAGAAGT GTACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTATCGTCC ACAGTTCTAC 7 00 

TTCCGCACAA CTGATGTTAC AGGAACAATC ACATTGCCAG AAGATACAGA 750 

A 751 

30 



2) INFORMATION FOR SEQ ID NO: 622. 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
45 (B) STRAIN: ATCC 13264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 

GTCAAATGGG ACGAATCCAG ATTCCAAGAA ATTGTCAAGG AAACCTCCAA 50 

50 CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 10 0 

TCTCTGGTTG GAACGGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 2 00 

GACTTTGTTG GAAGCCATTG ACGCCATTGA ACAACCATCT AGAC CAACTG 250 

ACAAGCCATT GAGATTGCCA TTGCAAGATG TTTACAAGAT TGGTGGTATT 300 

55 GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 350 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTCCGTTG 400 

AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 450 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

60 CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 600 
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GTTTTGGATT GTCACACTGC TCACATTGCT TGTAGATTCG ACGAATTGTT 65/) 
GGAAAAGAAC GACAGAAGAT CTGGTAAGAA GTTGGAAGAC CATCCAAAGT 700 
TCTTGAAGTC CGGTGACGCT GCTTTGGTCA AGTTCGTTCC ATCTAAGCCA 750 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 623 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 



TCTTGAAAGC 
TCTTTGGAAG 

25 CCGGTCACCG 
GACTGTGCCA 
TATCTCCAAG 
TCGGTGTCAG 
TGGTCCGAGG 

30 CAAGAAGGTT 
GTTGGCACGG 
CTTCTACGTG 
ACAAGGGATG 
CTCCTCGAGG 

35 GCCCCTCCGT 
CAGTCCCTGT 
TCTCATCTCT 
ATTTTCAGGT 
TCAAGTCCGT 

40 GACAACGTTG 
AGGTAACGTC 
CTTTCAACGC 
GGTTACACCC 
ACCTCCAACA 

45 TTGCTGAGTT 
GCCGCCCCCA 
TGCCCAGAAG 
GTCGATTCGC 



TTAAGGCCGA 
TTCGAGACCC 
AGACTTCATC 
TCCTCATCAT 
GACGGTCAGA 
GCAGCTCATT 
ACCGATTCAA 
GGTTACAACC 
TGACAACATG 
ATGAGATGTT 
GACCAAGGAG 
CCATCGACGC 
CTCCCTCTCC 
CGGCCGAGTC 
CTTGTCTCGT 
ATGGTCGTCA 
TGAGATGCAC 
GTTTCAACGT 
TGTGGTGACT 
CCAGGTTATC 
CCGTTCTCGA 
TACCTTTGAA 
GATCGAGAAG 
AGTTCGTCAA 
CCCCTCTGTG 
CGTCCGAGA 



GCGAGAGCGA 
CCAGGTACCA 
AAGAACATGA 
TGCCACCGGT 
CCCGAGAGCA 
GTTGCTTGCA 
CGAAATCGTC 
CCAAGGCTGT 
TTGGAGGAGA 
TTTCTGACTT 
ACCAAGTCTG 
CATCGAGCCC 
AGGACGTCTA 
GAGACCGGTG 
AACATGCGTC 
AGTTCGCCCC 
CACGAGCAGA 
CAAGAACGTT 
CCAAGAACGA 
GTCCTTAACC 
CTGTCACACT 
GCTGACCCTT 
ATTGACCGAC 
GTCTGGTGAC 
TTGAGACCTA 



GGTATCACCA 
GGTCACCGTC 
TCACCGGTAC 
ATCGGTGAGT 
CGCCCTCCTC 
ACAAGATGGA 
AAGGAGACCA 
CCCCTTCGTC 
CCACCAAGTC 
TCCGTGCAGC 
GTGTTTCCAA 
CCTACCCGAC 
CAAGATCGGT 
TCATCAAGGC 
TCGTAACATG 
CACCAACGTC 
TCCCCGAGGG 
TCCATCAAGG 
CCCCCCTATG 
ACCCTGGTCA 
GCCCGTAAGC 
TCTAGACATT 
GAACCGGTAA 
GCCGCCATTG 
CGCCGACTAC 



TCGACATTGC 50 

ATTGACGCCC 100 

CTCCCAGGCT 150 

TCGAGGCCGG 2 00 

GCCTTCACCC 250 

CACCTGCAAG 300 

ACGGTTTCAT 350 

CCCATCTCTG 4 00 

AGTGAATCCG 450 

ATGCCCTGGT 500 

GGGTAAGACC 550 

CCACCGACAA 600 

GGTATCGGCA 650 

CGGTATGTTG 700 

CGCTTACTTC 750 

ACCACTGAAG 800 

TCTTCCCGGA 850 

ACATCCGACG 900 

GAGGCTGCTT 950 

GATCGGTGCC 10 00 

CTGACCCAAT 1050 

GCCTGCAAGT 1100 

GGTCATGGAG 1150 

TCAAGCTTGT 1200 

CCCCCTCTTG 1250 
1269 



2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 

5 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 624 

TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

10 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CACCAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 200 

TAAGACCTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACCAGACCAA 2 50 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 3 00 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 3 50 

15 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 4 00 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 60 0 

2 0 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGT CAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



25 



2) INFORMATION FOR SEQ ID NO: 625 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
CGTTGAAGAC ACGACCCAAA GTATCC 26 

40 

2 ) INFORMATION FOR SEQ ID NO: 626 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 626 
55 TACCACCTTT TAAGTAAGGT GCTAAT 26 



60 



2 ) INFORMATION FOR SEQ ID NO: 627 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
ATTGTCTATA AAAATGGCGA TAAGTC 2 6 



10 



15 2) INFORMATION FOR SEQ ID NO: 628 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 

AAAATGGCGA TAAGTCACAA AAAGTA 26 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 629 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
3 5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629 
AAGTTCCATC TCAACAAGGT CAATA 25 

2) INFORMATION FOR SEQ ID NO: 630 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
CGGAGCTATC CTAGTCGTTT CA 22 

60 
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60 



2 ) INFORMATION FOR SEQ ID NO: 631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
15 CAGACCAACY GAIAARCCAT TRAGAT 2 6 



2) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

30 

CCCTTTGGTG GRTCSTKCTT GGA 23 



3 5 2 ) INFORMATION FOR SEQ ID NO: 633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 

CAGACCAACY GAIAARCCIT TRAGAT 2 6 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

AACACYGTCA GRRCIATTGC YATGGA 



2) INFORMATION FOR SEQ ID NO: 635 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 
AAACCRGTIA RRGCRACTCT IGCTCT 



2) INFORMATION FOR SEQ ID NO: 63 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636 
ACTGGYGTTG AIATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 7 
ACGTCAGT I G TACGGAARTA GAA 



2 ) INFORMATION FOR SEQ ID NO: 638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 

CCAATGCCAC AAACI CGTGA RCACAT 26 



10 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 639 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 9 
TTTACGGAAC ATTTCWACAC CWGTIACA 28 

2) INFORMATION FOR SEQ ID NO: 640 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 0 
TCCATGGT I T WYGG I CARAT GAA 23 

40 

2) INFORMATION FOR SEQ ID NO : 641 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 
55 TGATAACCWA CIGCIGAIGG CATACG 26 



60 



2) INFORMATION FOR SEQ ID NO: 642 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 



2) INFORMATION FOR SEQ ID NO: 643 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
ACTGGIGTIG ARATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 644 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 
ACGTCIGTIG TICKGAARTA GAA 



2) INFORMATION FOR SEQ ID NO: 645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645 
ACGTCIGTIG TICKGAARTA RAA 
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2 ) INFORMATION FOR SEQ ID NO: 646 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
15 ATCGACAAGC CITTCYTIAT GSC 23 



2) INFORMATION FOR SEQ ID NO: 647 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 

30 

ACGTCCGTSG TRCGGAAGTA GAACTG 26 



35 2) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648 

ACGTCSGTSG TRCGGAAGTA GAACTG 26 



50 



60 



2) INFORMATION FOR SEQ ID NO: 649 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 9 

GTCCTATGCC TCARACWCGI GAGCAC 



2) INFORMATION FOR SEQ ID NO: 650 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
TTACGGAACA TYTCAACACC IGT 



2) INFORMATION FOR SEQ ID NO: 651 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
TGACGACCAC CI TCYTCYTT YTTCA 



2) INFORMATION FOR SEQ ID NO: 652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 652 
CCWAYAGTIY KICCICCYTC YCTIATA 



2) INFORMATION FOR SEQ ID NO: 653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 
GAYTTCATIA ARAAYATGAT 



2 ) INFORMATION FOR SEQ ID NO: 654 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARATYK GIGGTATYGG 



2) INFORMATION FOR SEQ ID NO: 655 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
CCRATACC I C MRATYTTGTA 



2) INFORMATION FOR SEQ ID NO: 656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
AATTAATGGC TGCAGTTGAY GA 



2) INFORMATION FOR SEQ ID NO: 657 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 
TTGTC CACGT TCGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 658 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
GATYTAGTCG ATGATGAAGA ATT 



2) INFORMATION FOR SEQ ID NO: 659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 
GCTTTTTGIG TTTCWGGTTT RAT 



2) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 660 
GTAGAATTGA GGACGGTAGT TAG 
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2) INFORMATION FOR SEQ ID NO: 661 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
15 GTAGAAYTGT GGWCGATART TRT 23 



20 



2 ) INFORMATION FOR SEQ ID NO: 662 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: CoryneJbacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 

35 CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50. 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CAT CCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 200 

CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG CGACGAGAAG 250 

40 TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 3 00 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 350 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 40 0 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG ATCATCGGTA TCCGCGAGAA 450 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 500 

45 ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCGAG GCGCTTACAC 600 

CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 700 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 75 0 

50 GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 



AACGGCGAGC AAGACTTGGT GTTGGAGGTT TCTCAGCACT TGGGTGAGAA 50 

CACCGTGCGT ACCATTGCCA TGGACGGTAC CGAGGGTTTG GTGAGAGGTA 100 

CCGCTGTCAC TGACACCGGT GCTCCCATCT CGGTCCCCGT TGGTCAGGGT 150 

ACCTTGGGCC GGATCATCAA CGTTGTCGGT GAGCCCATCG ACGAGCGTGG 2 00 

15 TCCCATCCAG TGCAAGCAGA GAAACCCCAT TCACGCCGAG CCCCCGTCTT 250 

TCACCGAGCA GTCCGTCGAG GCTGAGGTGT TGGAGACCGG TATCAAGGTT 3 00 

GTCGACTTGT TGGCTCCCTA CGC CCGTGGT GGTAAGATTG GTCTTTTCGG 350 

TGGTGCCGGT GTCGGTAAGA CCGTGTTCAT CCAGGAGTTG ATTAACAACA 4 00 

TTGCCAAGGC CCACGGTGGT TTCTCCGTGT TCACTGGTGT CGGTGAGCGT 450 

20 ACTCGTGAGG GTAACGACTT GTACCGTGAA ATGAAGGAGA CCGGTGTCAT 5 00 

CAACTTGGAG GGCGACTCCA AGGTGGCCTT GGTGTTCGGT CAGATGAACG 550 

AGCCCCCGGG GGCTCGTGCC CGTGTCGCCT TGACCGGTTT GACCATTGCC 600 

GAGTACTTCC GTGACGAGGA AGGCCAGGAC GTGTTGTTGT TCGTTGACAA 650 

CATTTTCAGA TTCACCCAGG CCGGTTCCGA GGTGTCGGCG TTGTTGGGTC 700 

25 GTATCCCCTC CGCCGTCGGT TACCAGCCCA CTTTGGCCAC CGACATGGGT 750 

TTGTTGCAGG AGAGAATTAC CACCACCAAG AAGGGTTCCG TCACCTCTGT 8 00 

GCAGGCCGTG TACGTCCCTG CCGATGACTT GACTGACCCT GCCCCCGCCA 850 

CCACTTTCGC TCACTTGGAC GCCACCACCG TGTTGTCGCG TGGTATCTCC 900 

GAGTTGGGTA TCTACCCCGC CGTCGACCCC TTGGACTCCA AGTCGAGATT 950 

30 GTTGGACGTC GAGGTTGTTG GCCAGGAGCA CTACGACGTC GCCACCGGTG 10 00 

TCCAGGAGTG CTTGCAGGCC TACAAGTCGT TGCAGGACAT CATTGCCATT 10 50 

TTGGGTATGG ACGAGTTGTC CGAGCAGGAC AAGTTGACCG TCGAGAGAGC 1100 

CCGTAAGATC CAGCGTTTCT TGTCGCAGCC CTTCGCTGTC GCCGAGGTTT 1150 

TCACTGGTAT CCCCGGTAGA TTGGTGAGAT TGCAGGACAC CG 1192 

35 

2) INFORMATION FOR SEQ ID NO: 664 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
50 AAYATGATIA CIGGIGCIGC ICARATGGA 29 



2) INFORMATION FOR SEQ ID NO: 665 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 

374 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

10 ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 
TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 2 50 

15 AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 4 00 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

2 0 AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 6 00 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAAC CAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

2 5 GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 85 0 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 
GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

30 GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 12 00 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

35 ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 13 00 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



40 



50 



2) INFORMATION FOR SEQ ID NO: 666 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1536 bases 
45 (B) TYPE: Nucleic acid 

(C> STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

60 GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 200 

375 



SUBSTITUTE SHEET (RULE 26) 
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GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 250 

CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 3 00 

GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 3 50 

CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 400 

5 AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 45 0 

CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 50 0 

GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GC CAGAGGTG 550 

GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 60 0 

CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTC CGTTTT 650 

10 CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG TACCGTGAAA 700 

TGAAGGAAAC TGGAGTCATT AACTTGGAAG GTGAATCCAA GGTCGCCTTA 750 

GTTTTCGGTC AAATGAACGA ACCTCCAGGA GCCAGAGCCA GAGTCGCTTT 80 0 

AACTGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 850 

TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 90 0 

15 GTCTCTGCCC TTTTGGGTCG TATTCCATCT GCCGTCGGTT ATCAACCAAC 950 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 1000 

AGGGTTCTGT CACTTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 110 0 

CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 115 0 

2 0 TGGATTCTAA ATCAAGGTTA TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 1250 

ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 1300 

AACTAACTGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 13 50 

TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAGTGAGATT 140 0 

25 AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 1450 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 1500 

CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 153 6 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 667 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1293 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

50 CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 2 00 

ACACTTGGCA GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 2 50 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 4 00 

55 TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 450 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

60 GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 700 

376 
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CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTG*TTGGGTC TJ50 

GTATTCCCGC TGCCGTCGGC TACCAGCCTA CCCTTGCTGA GGATCTTGGG 'SOO 

CAGTTGCAGG AGCGCATTAC GTCCACGACA AAAGGTTCCA TTACCTCTGT 850 

GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

5 CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACCCA CTGGAGTGTG CCTCGCGTAT 100 0 

CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 

TGGTGCAGAT GCTTACCAAG TACAAGGAGC TGCAGGATAT CATTGCGGTG 110 0 

CTTGGCATTG ATGAGCTCAG TGAGGAGGAT AAACTTATCG TGGACCGTGC 1150 

10 GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC TTTCCAGGTG GCGGAGGTGT 12 00 

TTACTGGCAT GACAGGCCAC TACGTGCAGC TGGAGGAGAC CATTGAGTCT 12 50 

TTTTCCGGCC TGTTGATGGG CACATATGAT CAGGTGCCGG AGA 12 93 



15 



25 



2) INFORMATION FOR SEQ ID NO : 668 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1191 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium glutamicum 
(C) ACCESSION NUMBER: X77034 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 668 

GTGGCAAAGG CGAAGTTCGA GCGTACCAAG CCCCACGTAA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCG GCTATCACCA 100 

AGGTTCTGGC TGACACTTAC CCTGAGCTCA ACGAGGCTTT CGCCTTCGAC 150 

3 5 TCCATCGATA AGGCTCCTGA GGAGAAGGAG CGTGGCATCA CGATCAACAT 2 00 

CTCCCACGTT GAGTACCAGA CTGAAAAGCG CCACTACGCA CACGTTGACG 250 

CTCCAGGCCA CGCCGACTAC ATCAAGAACA TGATTACCGG CGCTGCTCAG 3 00 

ATGGACGGCG CAATCCTCGT TGTTGCTGCT ACCGACGGCC CAATGCCTCA 350 

GACCCGTGAG CACGTTCTTC TTGCTCGCCA GGTTGGCGTT CCTTACATCC 4 00 

4 0 TCGTTGCTCT TAACAAGTGC GACATGGTTG AGGATGAGGA AATCATCGAG .450 

CTCGTCGAGA TGGAAGTTCG TGAACTTCTT GCTGAGCAGG ACTACGACGA 500 

AGAGGCTCCA ATTGTTCACA TCTCCGCTCT GAAGGCTCTT GAGGGCGACG 550 

AGAAGTGGGG CAAGCAGATC CTTGAGCTCA TGCAGGCTTG CGATGACAAC 600 

ATCCCTGACC CAGTTCGTGA GACCGACAAG CCATTCCTCA TGCCTATCGA 650 

45 GGACATCTTC ACCATCACCG GTCGTGGCAC CGTTGTTACC GGTCGTGTTG 700 

AGCGCGGTAC CCTGAACGTG AACGATGATG TTGACATCAT CGGCATCAAG 750 

GAGAAGTCCA CCTCCACCAC CGTTACCGGT ATCGAGATGT TCCGTAAGCT 8 00 

TCTTGACTCC GCTGAGGCTG GCGACAACTG TGGTCTGCTT CTCCGTGGTA 850 

TCAAGCGCGA AGATGTTGAG CGTGGCCAGG TTATCGTTAA GCCAGGCGCT 900 

50 TACACCCCTC ACACCGAGTT CGAGGGCTCT GTCTACGTTC TGTCCAAGGA 950 

TGAAGGTGGC CGCCACACCC CATTCTTCGA CAACTACCGT CCTCAGTTCT 1000 

ACTTCCGCAC CACCGACGTT ACCGGTGTTG TGAAGCTTCC AGAGGGCACC 1050 

GAGATGGTCA TGCCTGGCGA CAACGTCGAC ATGTCCGTCA CCCTGATCCA 1100 

GCCTGTCGCT ATGGACGAGG GCCTGCGTTT CGCTATCCGC GAAGGCTCCC 1150 

55 GCACCGTTGG CGCTGGTCGT GTCACCAAGA TCATCAAGTA A 1191 



60 



2) INFORMATION FOR SEQ ID NO: 669 

377 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 

15 ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 50 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA 100 

ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 200 

GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 250 

20 CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 3 00 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 3 50 

CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 400 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 450 

GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

25 CGCGATCGAG CACTCCGGTT ACTCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 650 

TCTGCGCGTT GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG 700 

AAGGTCGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 750 

30 GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 800 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA 850 

CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ATACGTACCT 900 
GCGGATGACT TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 10 00 

35 CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 1050 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

TTATCAGGAA CTGAAAGACA TCATCGCCAT CCTGGGTATG GATGAACTGT 1150 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 1200 

CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 1250 

40 ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 1300 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 1350 

GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 13 83 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1410 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004014 

378 



SUBSTITUTE SHEET (RULE 26) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 



PCT/CA00/01150 



ATGAAAGCGA 
TGTGGAGTTT 
5 ATTTTGAAGT 
CATTTGGGTG 
CTTAGTGCGT 
CTGTGGGCGA 
ATTGATAATT 

10 AAAAGC CCCT 
CCGGTATTAA 
GTAGGCTTGT 
GCTTATCCAC 
GTGTGGGGGA 

15 GAAGGGGGCG 
GCCACCAGGT 
AGTATTTCCG 
ATCTTTAGAT 
TATCCCTTCA 

20 AACTTCAAGA 
CAAGCGGTGT 
GGTGTTTGCG 
AAAAAGGGAT 
TTAAGCCCTC 

25 CCAGCAAGTT 
TGGGATTGGA 
AGAAAAATTG 
TACAGGAAGT 
TTGGAGGGAT 

30 TACATGGTGG 
AAATTCCTAA 



TGGAAGGTAA 
GAATCCTATC 
TAATGGCGTT 
GTAATCGGGT 
AACCAAGCCG 
AGAAGTGTTA 
TAGAGCCGCT 
AGTTTTGAGC 
AGTCATTGAC 
TTGGTGGGGC 
AATGTGGCTT 
GCGCACCAGA 
TTTTAGACAA 
GCAAGGAATC 
TGATGAAAAG 
ACGCTCAAAG 
GCGGTGGGGT 
GCGTATCGCT 
ATGTGCCAGC 
CATTTAGATG 
TTATCCGGCG 
AAATGATTGG 
TTGCAAAAAT 
CGAATTGAGC 
AGAAGTTTTT 
CCCGGTAAGT 
TTTAGAGGGC 
GCAGCATTCA 



AATCATTCAG 
TGCCGGCGAT 
CAAAAATCTT 
GCGAGCGATT 
TCAAAGCTCG 
GGGCGTATTT 
TAAGCCGTCC 
AGCAAAGCAC 
TTGCTCGCGC 
TGGCGTAGGC 
ATAAGCATAA 
GAAGGGAACG 
AGTTGCGTTG 
GCATCGCATT 
GGCTTAGATG 
CGGTGCGGAA 
ATCAGCCCAC 
TCCACTAAAA 
AGACGACTTG 
CGACTACGGT 
GTGGATCCTT 
CGAGAAGCAC 
ACAAGGATTT 
GAAGAGGATA 
ATCCCAGCCG 
ATGTGACTCT 
AAATACGATC 
AGAGGTTTTA 



GTTTTAGGCC 
TTTTGAAGCA 
TAGTTTTAGA 
GCTATGGATA 
TGGCAAAATG 
TTAATGTTGT 
TTAACTTGGC 
TAAAACAGAA 
CTTATTCTAA 
AAAACGGTGA 
CGGGTATTCG 
ATCTGTATTT 
TGCTATGGGC 
CACCGGCTTG 
TGTTGATGTT 
ATGAGCGCGC 
GCTAGCCGGG 
ATGGCTCTAT 
ACTGACCCAG 
GTTGAATAGA 
TGGATTCCAC 
TATGAAATCG 
GCAAGATATT 
AAAAAACGGT 
TTTTTTGTGG 
CCAAGAGACT 
ACATTCCTGA 
GAAAAAGCTA 



CGGTGGTAGA 50 

CTAGACATTA 100 

GGTGGCAGCC 150 

TGACAGAAGG 200 

ATTGAAGTGC 250 

GGGCGAGAGC 300 

CCATTCACAG 350 

ATGTTTGAAA. 400 

GGGCGGTAAA 450 

TCATTATGGA 500 

GTGTTTGCAG 550 

TGAGATGAAA 600 

AAATGAATGA 650 

ACGATGGCGG 700 

TATTGATAAC 750 

TATTAGGCCG 800 

GAAATGGGGA 850 

CACTTCGGTT 900 

CCCCTGCTTC 950 

AAGATCGCTG 1000 

TTCAAGGATT 1050 

CCACCGGTAT 1100 

ATTGCGATTT 1150 

TGAAAGGGCC 1200 

CTGAAGTGTT 1250 

TTAGAGGGCT 1300 

AAACGCGTTT 1350 

AAAACATGAA 1400 
1410 



35 2) INFORMATION FOR SEQ ID NO : 671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylicum 

(B) STRAIN: DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

55 GGAGATGACA TAGTAAGAAC AATAGCAA.TG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 250 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 300 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 350 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 400 

60 TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

379 
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TTGTTGGGTG GTGCCGGTGT TGGTAAAACA 
AAATAATATA GCAAAAGAAC ACGGTGGATT 
GAGAAAGAAC AAGAGAAGGT AATGACCTTT 
GGAGTTATAA ATAAAACAGC TCTAGTATTT 
5 TGGCGCTAGA ATGAGAGTTG CTTTAACAGG 
TTAGAGACAA AGGTCAAGAT GTGCTTCTAT 
TTTACACAAG CTGGTTCAGA GGTTTCAGCT 
TGCCGTTGGT TATCAGCCAA CACTTGCAAA 
AGAGAATAAC ATCAACAAAA CAGGGTTCAA 

10 TATGTTCCTG CTGATGACCT TACAGACCCA 
GCATCTTGAT GCAACAACAG TTCTTTCAAG 
TATATCCTGC AGTTAGTCCT CTTGAATCAA 
AGAATTGTTG GAGAAGAGCA TTATGAAGTT 
ACTTGAAAGA TATCAAGAAC TTCAAGATAT 

15 ATGAACTTTC AGATGAGGAT AGATTGTTAG 
CAGAGATTCT TATCTCAAGC TTTTAGTGTT 
GAAAGGTCAG TTTGTACCTG TAAAAGATAC 
TATTAGATGG TAAGTGTGAT GATCTTCCAG 
GGAACAATAG AAGATG TAAA AGAAAAAGCT 

20 A 



PCT/CA00/01150 

GTTCTTATTC AGO£ACTTAT ioO 

ATCAGTATTC ACAGGTGTTG 550 

ATTATGAAAT GAAAGATTCA 600 

GGTCAGATGA ATGAACCACC 650 

ACTTACAATG GCTGAATATT 70 0 

TTATAGATAA TATATTCAGA 750 

TTACTTGGTA GAATACCTAG 8 00 

TGAAATGGGT GCTCTTCAAG 850 

TCACATCCGT TCAGGCTGTA 900 

GCTCCAGCAA CAACATTTAC 950 

AGAAATATCA AACTTAGGAA 1000 

CTTCAAGAAT ACTTGATCCA 1050 

GCTAACAAGG TTAAACATAT 1100 

CATAGCTATA CTTGGTGTTG 1150 

TTGGAAGAGC AAGAAGAGTA 12 00 

GCTGAACAAT TTACAGGAAT 1250 

TATAAGAAGT TTTAAAGAAA 13 00 

AAGCTGCATT TTTATTTGCA 1350 

AAAAAAATGA TGGAAAGCTA 1400 

1401 



2) INFORMATION FOR SEQ ID NO: 672 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2 03 9 

(C) ACCESSION NUMBER: M2253 5 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 



ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 2 00 

45 AAGTCGTGGA GCAGAGGTTA ATGCAACAGG AAGCGCTATA CAAATGCCAG 2 50 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 300 

GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGCCAATACA 3 50 

CAGAGAGGCA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 4 00 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

50 AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATTCA 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

55 CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 750 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 800 

AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 850 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 900 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

60 GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 1000 

380 



SUBSTITUTE SHEET (RULE 26) 
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CGGTTTACGT ACCTGCGGAT GATTTAACGG 
TTTGCTCACT TAGATGCAAC AACGGTATTG 
AGGTATTTAC CCAGCGGTAG ATC CATTAG A 
CTCCAGAAAT TTTAGGAAAA GATCACTACT 
5 GAGTTGTTAC AACGTTATAA AGAATTACAA 
TATGGAAGAA TTATCTGAGG AAGATAAAAT 
GTGTACAACG TTTCTTATCT CAGCCTTTCC 
GGTCTTAAAG GTGTTTTAGT AGATATCAAG 
TATGATTATG GATGGTGAAT TAGATCACTT 
10 TTAAAGGTAC TATTGAAGAA GCTATAGAAG 
GAAGCATAA 



PCT/CA00/01150 

ATCCAGCACC GGCA&CTACC 

TCTCGTAAAA TTGCAGAGTT 1100 

TTCTACTTCT AGAATCTTAG 1150 

CTTGTGCACA ACGTGTAAAA 12 00 

GATATTATTG CTATCCTTGG 12 50 

GGCAGTTGGT AGAGCAAGAC 13 00 

ACGTAGCAGA GCAATTTACA 13 5 0 

GATACTATTA AAGGATTTAA 14 0 0 

ACCAGAATCT GCATTTAACC 14 5 0 

CAGGAGAAAA AATGCTTGCT 1500 

1509 



15 2) INFORMATION FOR SEQ ID NO: 673 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticix 

(B) STRAIN: HRC-IL 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 



CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 100 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 150 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 200 

35 TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 250 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 300 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 350 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 4 00 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

40 GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 70 0 

45 AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 750 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 80 0 

AAGTGGTTCT ATAACATCT 819 



50 



60 



2) INFORMATION FOR SEQ ID NO: 674 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 840 bases 
55 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

381 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 

AGAGCGAAGT ACCAAGTGTT TACGATGCTC TGAATGTTGT GGATTCCAAA 50. 

GAACGTCTGG TTCTGGAAGT TCAACAGCAG CTAGGCGGTG GCGTGATTCG 100 

CGCTATCGTT ATGGGTAGCT CGGATGGTTT ACGTCGTGGA ATGACAGTAC 150 

10 AAAACACTGG CGCTCCAATT TCAGTACCAG TAGGTACTAA AACCCTAGGT 2 00 

CGTATCATGA ACGTGCTTGG TGATGCGATT GACGAACGTG GCGACATTGG 2 50 

CGCAGAAGAG GTGTACTCGA TTCACCGTCC TGCTCCAAGC TACGAAGAAC 300 

AGTCTAGTGC AACTGAACTT TTGGAAACGG GTGTTAAGGT TATCGACCTG 350 

ATCTGTCCGT TTGCGAAAGG CGGTAAAATC GGTCTGTTCG GTGGTGCGGG 400 

15- TGTAGGTAAG ACCGTTAACA TGATGGAACT TATCAACAAC ATCGCGCTAC 450 

AGCACTCAGG TTTGTCAGTA TTTGCTGGGG TAGGTGAGCG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGCAGGAA GCGGGCGTTG TAAACGTTGA 550 

ACAACCAGAA CTGTCGAAAG TAGCGATGGT TTACGGTCAG ATGAACGAGC 600 

CACCAGGCAA CCGTCTGCGT GTAGCACTGA CTGGTCTGAC TATGGCGGAA 650 

20 AAGTTCCGTG ATGAAGGCCG TGACGTACTG CTGTTTATCG ACAACATCTA 700 

CCGTTACACC CTAGCGGGAA CGGAAGTATC TGCTCTGCTT GGCCGTATGC 750 

CTTCAGCGGT AGGTTACCAA CCAACACTGG CTGAAGAGAT GGGTGTTCTG 8 00 

CAAGAACGTA TCACGTCAAC CAAAAAAGGT TCTATCACCT 84 0 



25 



40 



60. 



2) INFORMATION FOR SEQ ID NO: 675 



( i ) SEQUENCE CHARACTERI STICS : 
30 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 258 70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 



CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

45 TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC CCAGGTGATG 2 00 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 250 

CAGTGGGAAG CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 3 00 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 3 50 

50 ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 4 00 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 4 50 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

55 ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 828 



382 
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2) INFORMATION FOR SEQ ID NO: 676 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 5012 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 



CTTCTCGGAT 
2 0 AACTCGGGCG 
GCGCACACCG 
GAAGTCGAAG 
GCGAGACGCT 
CGCGGCCCCG 

2 5 GAAGCTGGCG 

AAGTGATCGA 
TTCGGCGGTG 
CAACGTCGCG 
AGCGCACCCG 

3 0 GTGATCGACC 

GAACGAGCCC 
TGGCCGAGTA 
GACAACATCT 
GGGCCGCATC 
35 TCGGCATGTT 
TCTGTGCAGG 
GGCGACGACA 
TGGCGGAGTC 
CGTATCATGG 

4 0 GGACATTGTT 

CGGTGCTTGG 
CGTGCGCGCA 
GGTGTTTACT 
AGTCGTTCTC 

45 



GGCGTGCCGC 
CGACGAGCCG 
GCCGCTGCAT 
GTCGTGTCGA 
TGGCCGCATC 
TGGGCGAGAA 
GACCAGGCGG 
TCTGATCCTG 
CCGGTGTGGG 
AAGGGCCACG 
CGAGGGGACG 
TGAAAGGTGA 
CCGGGTGCGC 
CTTCCGCGAC 
TCCGCTTCAC 
CCCGCCGCTG 
GCAGGAGCGC 
CGGTGTACGT 
TTCTCGCACC 
TGGCATCTAC 
ACCCTGATGT 
CAGATGCTGA 
TATCGACGAG 
AGGTGACCCG 
GGCATGACAG 
CGGTTTGCTG 



CGGTGCTGAC 
CTGACGCTAG 
TGCGATGCAG 
CCGGCGGGAA 
TTCAACGTAC 
GATGCGCATG 
CGGAGGACAC 
CCGTACTGCA 
GAAGACTGTG 
GTGGCTTCTC 
GATCTGTACC 
GTCGAAGTGT 
GCGCGCGCGT 
GTGGAGGGCC 
GCAGGCGAAC 
TGGGCTACCA 
ATCACGTCGA 
GCCTGCGGAT 
TGGACGCGAC 
CCCGCGGTCA 
GATCGACGTG 
CCAAGTACAA 
CTGAGCGAGG 
GTTCCTGTCA 
GCCATTACGT 
ATGGGGTCGT 



GGCGCTGGAC 
AGATCGTGCA 
ACGACGGACC 
CATCTCTGTG 
TGGGCGACGC 
GCGATCCACG 
GATCCTGACG 
AGGGCGGCAA 
ATCATCATGG 
CGTGTTCGCC 
TGGAGATGAT 
GTGCTTGTGT 
TGCGCAGTCT 
AGAACGTGCT 
TCCGAGGTGT 
ACCGACGCTT 
CGACGAAGGG 
GATATCACGG 
GACGGTGCTG 
ACCCACTGGA 
GATCACTACA 
GGAGCTGCAG 
AGGACAAGGT 
CAGCCGTTCC 
ACAGCTGAGC 
ACGACCAGAT 



GTGACGGAGG 


50 


ACACTTGGAC 


100 


TACTGAAGCT 


150 


CCTGTGGGCC 


200 


GATTGACCAG 


250 


CGGAGGCGCC 


300 


ACCGGCATCA 


350 


GATCGGTCTG 


400 


AGCTGATCAA 


450 


GGCGTTGGGG 


500 


GCAATCGAAG 


550 


ACGGGCAGAT 


600 


GCGCTGACGA 


650 


GCTGTTCATC 


700 


CAGCGCTGCT 


750 


GCGGAGGATC 


800 


GTCGAT CACG 


850 


ATCCCGCGCC 


900 


GACCGCGCGG 


950 


GTGCGCGTCG 


1000 


ACGTTGCGCA 


1050 


GATATCATTG 


1100 


TGTGGTGGAC 


1150 


AGGTTGCGGA 


1200 


GACACGGTGG 


1250 


TCCGGAGA 


1298 



2) INFORMATION FOR SEQ ID NO : 677 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bahesia microtti 
60 (B) STRAIN: Persing-1 

383 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 677 

CAAGCTCAAG TCTGAGCGTG AGAGAGGTAT TACTATTGAC ATTACTCTCT 50 

5 GGAAATTTGA GACCCAGAAA TACGAGTACA CTGTCATAGA CGCACCTGGT 10 0 

CATCGTGACT TTATCAAAAA TATGATTACT GGGACTTCAC AAGCCGACGT 150 

TGCTATGCTT GTCGTTCCTG CTGAATCTGG CGGATTCGAG GCTGCTTTTT 2 00 

CCAAAGAAGG TCAGACCCGT GAACACGCCT TACTAGCCTT CACACTTGGC 2 50 

GTCAAACAGA TGATTGTTGC TATTAACAAA ATGGATTCTT GTCAGTACAA 3 00 

10 GGAGGATCGT TATATGGAAA TTTTCAAGGA AGTACAGCAG TACTTGAAGA 350 

AGGTGGGTTA CAAAGTTGAA AGCGTGCCGT TTGTTGCTAT TTCAGGATTC 4 00 

CACGGTGACA ACATGGTTGA AAAATCTACT AACATGCCTT GGTATAAGGG 450 

TAAGACCCTC GTAGAGGCAC TTGATCAAAT GGAGCCTCCA AAACGTCCGG 500 

TCGAAAAACC TCTTAGATTG CCCCTGCAGT CAGTCTATAA AATTGGAGGT 550 

15 ATTGGTACGG TACCAGTCGG AAGGGTCGAA ACAGGACAAC TGAAAGCAGG 600 

AATGATCATT ACTTTTGCCC CCACTGGTTT GACCACTGAA TGTAAATCTG 650 

TTGAAATGCA TCACGAGGTT GTGGAAGTGG CTAGCCCCGG TGATAACGTT 700 

GGATTTAATG TCAAGAATGT GTCTGTTAAG GATATTAAGA GAGGAAATGT 750 

GGCTTCGGAT TCGAAAAATG ACCCAGCCAA GGAAGCTACC TCTTTCTCTG 800 

20 CACAAGTCAT TGTACTCAAT CACCCTGGTA CCATCAAGGC CGGTTACTCA 850 

CCTGTGGTTG ATTGCCATAC TGCCCACATT GCTTGCAAAT TCGAATCTCT 90 0 

AGACACTAGG ATTGACAAGC GTACTGGCAA GACTTTGGAA GAAAATCCTA 950 

AGACTATTAA GAATGGTGAC GCTGCCATGG TGACTATGAA ACCAAATAAA 10 0 0 

CCCATGGTTG TGGAAACTTT CACCGACTAC GCCCCGTTGG GCCGGTTCGC 1050 

25 CGTGCGTGAT ATGCGC CAAA CCGTTGCCGT CGG 1083 



30 



2) INFORMATION FOR SEQ ID NO: 678 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN; Lev- 12 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 678 

45 TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 2 00 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 2 50 

50 CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 3 00 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 3 50 

TTTTCCGATT CACCCAGGCC GGTTCCGAGG TGTCTGCCTT GCTCGGTCGT 4 00 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 4 50 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

55 TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 



60 2) INFORMATION FOR SEQ ID NO: 679 

384 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 552 bases 

(B) TYPp: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformsins 

(B) STRAIN: ATCC 44104 



15 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 9 



TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 10 0 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 200 

2 0 GGGCCCTGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGCCGAGT 300 

ACTTCCGTGA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

TTCCGATTCA CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 4 00 

CCCCTCTGCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

25 TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

ACTTACATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 

TC 552 



2) INFORMATION FOR SEQ ID NO: 680 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 1018 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella. JbejrtJioIIetiae 

(B) STRAIN: ATCC 42115 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

50 AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 200 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 250 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 3 00 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTAC CAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 4 00 

55 CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

60 GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

385 
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CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGAAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 800 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTACTACT ACCAAGAA 1018 



2) INFORMATION FOR SEQ ID NO : 681 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 

2) INFORMATION FOR SEQ ID NO : 682 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
GT I AC I GGYT CYTCRAARTT ICCICC 26 

40 

2) INFORMATION FOR SEQ ID NO: 683 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
55 GTIACIGGIT CISWIAWRTC ICCICC 26 



60 



2) INFORMATION FOR SEQ ID NO: 684 

386 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida tropical is 

(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

15 ATGGCTGGTG CTTTAGAAAA CGCAAGAAAA GAAATTAAAC GTCTTTCATT 50 

AGATGACACC AATGAATCCC AATATGGTCA AATCTATTCT GTTTCCGGTC 100 

CGGTTGTTAT TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 

AAAGTTGGTC ATGATAATTT AGTTGGGGAA GTTATTAGAA TTAATGGTGA 2 00 

TAAAGCAACC ATTCAAGTTT ATGAAGAAAC TGCAGGGGTC ACTGTTGGTG 250 

20 ATCCAGTTTT AAGAACTGGT AAACCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 

AGATGAATCC CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 400 

TATCAAGAAC TGTTCAATAT GATTTCACTC CAGGTCAATT GAAAGTTGGT 450 

GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

25 ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGTACTATTA 550 

CTTCTATTGC TGAAGCCGGT TCTTATAATG TTGAAGAACC AGTTTTGGAA 600 

GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 

AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 

TGTTGACCGG TCAAAGAGTC TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 

30 GGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 

TCAATCTTTG TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 

GTTTCACTAA AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 

ATTGAATCTA TTGAAGTTGG TGACAAAGTC ATGGGTAAAG ATGGTATGCC 950 

AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

35 GTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

CAAGATTGCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 
TCTATGTTTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGCCAAG . 1200 

ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

40 AGCTGCTACT TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 1300 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 1350 

ACCACTCAAA TGATCAACCC AGTTCATTTT GAATCTGGTA AACTCGGTAA 1400 

CTGGTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 1450 

ACTTGTTGGG TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 

45 ACCATGAACT CCAAAGATGA TGTTAAATTA GCTACAAGAA TTATGAACTA 1550 

CTCTTCAAAA TTGGGCATGA CTTGTTCTTC TACTGAATCC GGTGAACTCA 1600 

ATGTCGCTGA AAACGAAGAA GAATTTTTCA ATAACCTTGG TGCTGAAAAG 1650 

GATGAAGCTG GTGATTTCAC TTTTGATGAA TTTACCGATG CTATGGATGA 1700 

ATTGACTATC AATGTTCATG GTGCAGCTGC AAGCAAGAAG AACAATTTGT 1750 

50 TGTGGAATGC TTTGAAATCT CTTGGTTTCA GAGCCAAGTC TACTGATATT 1800 

GTCAAGAGTA TTCCTCAACA TATTGCTGTT GATGATATTG TTGTCAGAGA 1850 

ATCTTTGATT GCCGGTTTAG TTGATGCTGC TGGTAATGTT GAAACCAAAT 1900 

CCAATGGTTC TATTGAAGCT GTTGTTAGAA CTTCTTTCAG ACATGTCGCT 1950 

AGAGGTCTTG TCAAGATTGC TCATTCTTTG GGTATTGAAT CATCTATTAA 2000 

55 TATTAAAGAT ACTCACATTG ATGCTGCTGG TGTTAGACAA GAATTTGCTT 2050 

GTATTGTCAA TTTGACTGGT GCTCCACTTG CTGGTGTTCT TTCTAAATGT 2100 

GCACTTGCAA GAAACCAAAC TCCAGTTGTC AAATTTACCA GAGACCCAGT 2150 

TTTGTTCAAC TTTGATTTGA TCAAATCTGC AAAAGAAAAC TATTATGGTA 2200 

TTACTTTGGC TGAAGAAACT GATCATCAAT TCCTTTTATC CAACATGGCC 2250 

60 TTGGTGCACA ACTGTGGTGA ACGTGGTAAT GAGATGGCTG AAGTTTTGAT 2300 

387 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGAATTCCCA GAATTGTTTA CTGAAATTTC TGGTAGAAAA GAACCAATTA 2350 

TGAAACGTAC CACTTTGGTT GCCAATACTT CTAATATGCC AGTCGCTGCC 2400 

AGAGAAGCTT CTATTTATAC TGGTATTACA TTGGCTGAAT ATTTCAGAGA 2450 

TCAAGGTAAG AATGTTTCTA TGATTGCTGA TTCTTCTTCA CGTTGGGCTG 2500 

5 AAGCTTTGAG AGAAATTTCT GGTAGATTGG GTGAAATGCC TGCTGATCAA 2550 

GGTTTCCCAG CTTATTTGGG TGCTAAATTG GCTTCTTTCT ATGAGCGTGC 2600 

CGGTAAAGCC ACTGCTTTGG GTTCACCAGA TAGAGTTGGT TCAGTTTCTA 2650 

TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG ATTTCTCTGA TCCAGTTACT 2 700 

ACTTCTACTT TGGGTATTAC TCAA.GTTTTC TGGGGGTTGG ATAAGAAATT 2 750 

10 GGCCCAAAGA AAACATTTCC CATCTATTAA CAC CAGTGTT TCTTATTCTA 2800 

AATACACCAA TGTTTTGAAC AAATACTATG ATTCCAACTA TCCAGAATTC 2 850 

CCACAATTGA GAGACAAAAT TAGAGAAATT TTATCTAATG CTGAAGAATT 2 900 

GGAACAAGTT GTTCAATTAG TTGGTAAATC TGCATTGTCT GATTCTGATA 2 950 

AGATTACTTT AGATGTTGCT ACCTTGATTA AAGAAGATTT CTTGCAACAA 3 000 

15 AATGGTTATT CTTCATATGA TGCATTCTGT CCAATTTGGA AGACTTTTGA 3 050 

TATGATGAGA GCATTTATTT CATATTATGA TGAAGCACAA AAAGCAATTG 3100 

CCAATGGTGC TCAATGGTCT AAATTAGCTG AAAGTACTAG TGATGTTAAk 3150 

CATGCTGTTT CTTCAGCTAA ATTCTTTGAA CCATCAAGAG GTCAAAAAGA 32 00 

AGGTGAAAAA GAATTTGGAG ATTTATTAAC CACTATCTCC GAAAGATTTG 3250 

2 0 CTGAAGCTTC AGAATAA 32 67 



25 



2) INFORMATION FOR SEQ ID NO: 685 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 5 (A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 



TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TTATGGCAGA 50 

AAATATGTCA GAAGCAAGTA TTCAAGACAT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAACCTG TTCGTTCCAC 2 00 

45 TGGGGAAGCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 250 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 3 00 

AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 350 

ATGGTGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGCTGGAG 4 00 

ACATCATTGG GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 450 

50 GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 500 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

AGCTGACGAT GATGCAAAAA TGGCCAGTAC GTCGTGGTCG ACCAATCAAA 600 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

55 TTGGTGCAGG GAAGACAGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 750 

GTAGATCTAG TGGTTTACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 800 

GGATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 850 

CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 900 

GCTGCTCGAG AAGCTTCTAT TTATACGGGA ATCACGATTG CCGAGTACTT 950 

60 CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 1000 

388 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGGCAGAAGC 
GATGAAGGTT 
ACGTTCAGGA 
TCACTGCCAT 
5 GTGACTCAAA 
TAGTCTTGCT 
ACTCATTATA 
CAGGATTGGT 
AGAACAATTA 

10 ATAACGATCG 
TTACAACAAA 
ACAATTCAAC 
AAGCCTTATC 
GCGGTCAGAG 

15 AGCCAAAATT 
TTGTTTCAGA 



ACTGCGTGAA 
ATCCCGCTTA 
CGTGTGATTG 
CAGTGCGGTT 
ATACCTTACG 
CAAAAAAGAC 
TTCAACAGAA 
CTGATATGGT 
AATGAAATCG 
CTTGACCCTT 
ATGCTTTTGA 
ATGTTGAAAG 
GTTGGGAGCG 
AACGCATTAG 
AGTAGTATAA 
AGGAGGGATG 



ATGAGCGGAC 
TCTGGGCTCT 
CTCTAGGCTC 
TCTCCTTCTG 
TGTGGTGAAG 
ATTTTCCATC 
GTTGGCAGAT 
AACTGAAGGC 
TGCGCTTGGT 
GAAGTAGCAA 
TGATGTAGAT 
TTATTTTGAC 
TACTTCAATG 
TCGGAGCAAG 
ATGAAGAAAT 
ACCGATGATT 



GTTTAGAAGA 
CGTCTAGCTG 
TGACCAACGT 
GTGGAGATAT 
GTTTTCTGGG 
GATTAACTGG 
ATATGGATCA 
ATGCGGATCT 
AGGGATCGAT 
AATCGATTCG 
ACGTTTACTT 
TTTTGGGAAA 
AAATCATGGA 
TATATTCCAG 
CAAAGAAACG 
AA 



PCT/CAOO/01150 

AATGCCTGGT 105CT 

AATACTATGA 110 0 

GAGGGCAGTA 1150 

CTCTGAACCA 1200 

GATTAGATTC 1250 

ATCCAAAGTT 1300 

AATCTTACAA 1350. 

TGCAAGAAGA 1400 

TCGCTTTCTG 1450 

AGAAGACTAT 1500 

CAAGAGAAAA 1550 

GAAGCTCGAA 1600 

AGGTACAGTA 1650 

AAGAAGAGTT 1700 

ATCCAATTGA 1750 
1782 



2 0 2) INFORMATION FOR SEQ ID NO: 686 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 81 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 

(C) ACCESSION NUMBER: Genome project 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 6 

CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 

40 GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 2 00 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 300 

TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATTTATGGAA 350 

TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GG AG AT CTTC 400 

45 TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 450 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 500 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 55 0 

GTGCCTTTAC TATGGTGCAA AGATGGCCGA TCAAACAAGC TTTTATTGAA 60 0 

GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

50 AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC ACCATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 800 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATACCGGAA 850 

AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 900 

55 GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTG CAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 100 0 

GATGGGCACA AGCCCTTAGA GAGATTTCGG GACGTCTTGA AGAAATCCCT 1050 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

60 CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 1200 

389 



SUBSTITUTE SHEET (RULE 26) 
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ACTCAATCTA 
ACGAGCTGAC 
CAAAATATTT 
TGGGGTGGTG 
5 AATCGGCAAG 
ACATGGAAAT 
CAGAACGCAT 
AGAGTTATTT 
ATAGTCCTGA 
10 AAGACATTAA 
AGAGGTCATA 
GATATGCAAA 



CATTAGCTGT 
GCACGTAGGT 
GAACCAGGTA 
CTGTGAAAAA 
CGTATGGAAG 
CTACTTAAAG 
TCGATCCTGT 
TCATTAATCA 
TGATGCAAGA 
ATGGCCTGAA 
GTTAGACTGT 
CAATCTACAC 



AGTCGGAGCG 
ATC CTTCAAT 
GGACAAATTT 
AGCAGCACAG 
TTGTCAGTGA 
GCAGAACTTT 
GGACTGTTAT 
GTCGTATTTT 
AGCTTTTTCC 
ATTTCTTTCA 
TGGAAAAAAC 
AAAAATAACT 



TTCTGTGGTC 
AGACCCTTTG 
TAGAAGAGAA 
TTTCTAGAGA 
AGAAGGGGTT 
ATGATTTTTG 
TGTCCTTTTG 
TGATGCTAAA 
TTGAGCTGCA 
GAGGAATATC 
AATGGTACAA 
G 



PCT/CA00/01150 

TTTCAAAAGC 12 50 

ATTTCTTGGT 1300 

GGTTTCAGGC 1350 

AAGGTTCAGA 1400 

TCTATGGAAG 1450 

TkATCTCCAG 1500 

AGAGACAGAT 1550 

TTTGTTTTTG 16 00 

GAGCAAGATT 1650 

ATGAGAGTAA 170 0 

ATGGCGTAAG 1750 
1781 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 687 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 
(C) ACCESSION NUMBER: S56356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 



ATGAGTCAAG 
CGGTCCGGTC 
TCTACGTGGG 

35 GACGTAACCA 
CCAGCCCGTC 
GGATGCTGGA 
GAAGACGAGA 
CGACCTCGAC 

40 AGGTCGCGGC 
GAACACAAGG 
CGTCGAATCC 
CCGGCGAGGA 
CCCACCGTCG 

45 CATCCTCGAC 
CGGGGCCGTT 
TTCGCCGACG 
CGAGATGACG 
CCGGGAACCC 

50 ATGCCGGTTG 
GGAGTACTAC 
CCTCGCGGTG 
ATGCCCGGCG 
GTTCTACGAG 

55 CCATCTCGGT 
CCGGTCACCC 
CTCGGACCTC 
CCTACAGCCT 
GTCGACGACT 

60 GGAATCCGAA 



CTGAAGCAAT 
GTGACCGCCA 
CGACGAGGGT 
CCATCCAGGT 
GACAACACGG 
CTCCATCTAC 
TGGGGGCGTT 
ACCGACTGGG 
CGGCGATGTC 
TTCTGGTGCC 
GGCACGTTCA 
GATCCAGATG 
ACAAGCAGAC 
GGCCTGTTCC 
CGGGTCCGGG 
CGGACATCGT 
GAAGTCATCG 
GCTGATGGCC 
CCGGGCGTGA 
CGCGACATGG 
GGCGGAGGCC 
AGGAGGGGTA 
CGCGCCGGCT 
CATCGGTGCG 
AGAACACGCT 
GCCGAGCGCC 
CTACAAGGAC 
GGGCCGAGCA 
CTCGAAGAGA 



CACTGACACC GGCGAAATCG 
CGGGCCTCGA CGCGCAGATG 
CTGATGGGCG AGGTCATCGA 
CTACGAGGAG ACGTCCGGCA 
GCGAACCGCT CACCGTGGAC 
GATGGTGTGC AGCGTCCACT 
CCTCGATCGC GGTGTTGACG 
AGTTCGAGCC CACCGTCGAG 
GTCGGCACCG TCGACGAAAC 
CCCCCGCAGT GACGGCGGCG 
CCGTCGACGA CACGGTCGTC 
CACCAGGAGT GGCCGGTCCG 
GCCGACGGAG CCGCTGGTGT 
CGATCGCGAA AGGCGGGACG 
AAGACGGTCA CCCAGCAGTC 
TGTCTACATC GGCTGTGGTG 
AGGACTTCCC GGAGCTGCCC 
CGCACCACGC TCATCGCGAA 
GTCCTGCATT TACACGGGAA 
GCTACGACGT GGCGCTGATG 
ATGCGGGAGA TCTCCTCGCG 
TCCCGCGTAC CTGGCCGCCC 
ACTTCGAGAA CTTCAACGGG 
GTGTCGCCGC CCGGCGGGGA 
GCGCATCGTG AAGACGTTCT 
GGCACTTCCC GGCGATCAAC 
CAACTCGACC CGTGGTTCAC 
GCGCCAGTCG GCGGTCGACA 
TCGTGCAGCT CGTCGGGAAG 
390 



AGAGCGTGAG 50 

AACGACGTCG 100 

GATCGAAGGC 150 

TCGGGCCGGG 200 

CTGGGCCCGG 250 

GGACGTCCTC 3 00 

CACCCGGCAT 350 

GCGGGCGACG 400 

GGTCAGCATC 450 

AAGTCGTCGC 500 

GAGTTGGACA 550 

CCGCCAGCGC 600 

CCGGCCAGCG 650 

GCCGCGATCC 700 

CCTCGCGAAG 750 

AGCGCGGCAA 800 

GACCCCCAGA 850 

CACGTCGAAC 900 

TCACCATCGC 950 

GCCGACTCCA 1000 

ACTCGAGGAG 1050 

GCCTCTCGGA 1100 

ACCGAGGGCT 1150 

CTTCTCCGAG 1200 

GGGCGCTTGA 1250 

TGGGACGAGT 1300 

GGACAACGTC 1350 

TCCTCGACGA 1400 

GACGCGCTGC 1450 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCGAGGACCA GCAGCTCACG CTGGAAGTCG 
TGGCTCCAGC AGAACGCGCT CCACGACGTG 
GAAGACGTAC GCCATCCTCT CCGGCATCAA 
TTGAGGCGTT GGACGCCGGT GTGCCAGTCG 
5 GCCGCGCCGC GCCTGAATCG TCTCGGCACG 
GGCGGAGGTC GCGGAGATCA AACAGCAGAT 
TCTACTGA 



PCT/CA00/01150 

CGCGGTACAT CCGCSAGGCG US 00 

GATCGCTACT GCCCGCCCGA 1550 

GACGCTTCAC GAGGAGTCCT 1600 

AGGAGATCAC GTCGATCGAC 1650 

ACGCCCGACG ACGAGCACGA 1700 

TACCGAGCAG CTTCGGGAGC 1750 

1758 



10 



2) INFORMATION FOR SEQ ID NO: 688 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3118 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 

GAATT CCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG GCTGATAAAA 50 

TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG AGGACCGGGA 100 

GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG GTGACAGCTG 150 

3 0 AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT CGGTTATTAT 200 

GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG CAACAATCCA 250 

AGTATACGAA GACACCTCAG GTGTGACAGT AGGCGATCCC GTGCTGCGCA 300 

CAGGCAAGCC GCTGTCCGTG GAACTGGGAC CCGGAATCAT GGGCAGCATC 350 

TTCGACGGTA TCCAGCGACC GCTGAAGGAT ATCAATGAAC TGTCAAATAG 400 

3 5 TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT CGCACTGCAC 450 

AGTGGGACTT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA CATTACTGGT 500 

GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA AACACAAGTT 550 

GCTGCTGCCG CCCCGTGCCA AGGGAACTGT CACGTACATT GCAGAACCTG 600 

GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT TGACGGCGAG 650 

40 CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC AGCCCAGGCC 700 

TGTTACAGAA AAGTTGCCAG CTAACTACCC CCTCCTTACT GGCCAGCGTG 750 

TGCTCGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC AGCTATTCCT 800 

GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT TGTCAAAATA 850 

CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG CGAGGTAATG 900 

45 AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT GGAGATTGAT 950 

GGTGTGACTG AATCAATCAT GAAGAGAACA GCCCTGGTCG CAAACACATC 1000 

AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 1050 

TGTCAGAATA CTTCAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 1100 

TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 1150 

50 TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACGACTTG 1200 

CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAACCCAGAC 1250 

AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 1300 

CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 1350 

GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 1400 

55 TGGCTCATCT CGTACAGTAA ATACATGCGT GCTCTGGATG ACTTCTACGA 1450 

CAAGAATTTC CCAGAGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 1500 

TGCAGGAGGA AGAAGACCTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 1550 

TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 1600 

GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 1650 

60 CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 1700 

391 
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PCT/CA00/01150 



ATGTCTCGGC ATGCAGTTGA ATCTACTGCT CAGAGCGAGA ACAA&ATCAC 175^0 

TTGGAATGTT ATTAGAGATT CTATGGGCAA TATTCTGTAT CAGCTTTCCT 180 0 

CCATGAAATT CAAGGATCCA GTCAAGGATG GAGAAGCGAA GATCAAGGCA 1850 

GACTTTGAGC AGCTTCATGA AGACATTCAG CAAGCCTTCA GGAACCTGGA 1900 

5 GGATTAAAGT GGTAGCTGCC AGTGGTTCTC TCGGTGCAGT TGTCACATTT 1950 

GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT CGGTGCTAGA CACCTGAGCA 200 0 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT GGAATTTCAG TTGTAAAAAG 2050 

CTGGTTCCAT TGGTGCTAAG ATTATGTTGT GCCCTTTTCT GCTTCTCACA 2100 

TTCCAACAGA GGAATTTACT TCCAGTTTTC TTCCATTTTC CTCCTCATTT 2150 

10 TAAGTGTCGG TACAGAGGCA ATAATCTGAT AACTCTGTAC CGTCACTTAC 2200 

AAGCAGGGAG AATTTGTAAT TATTACAAAT CCCATTATCT CTGTGCACCA 2250 

CAGCCTTGTA AATTCATTTG TCCCAGGACT CCCTCTTGTG TGTACGTGAG 23 00 

ATTGCCGTCT GTATGTATGT ACACACCGTA CTGCAGTATT TGAAGTCAGT 2350 

CAGAAGGTGA ATTACACCAC TTACTCATTG TGTCACGTAG CAAGTGTGCA 2400 

15 AACTGCCATC CATTGTCCTA TTTATTCACA TAACTAGTTT TCTTTGCATT 2450 

TCCAGTGTTG CAAATTGTGT TTAGAAAATT ATGCCATCGA GACTGGTCGA 2500 

ACCTCACATT GTAACTCAGT ATTTACACAC ACGTTTACTT GCTACAGAAA 2550 

TGTAGAAAAA ATAATTGTTG TATATTGAAA GTACAAGTGA CAAAGTTGCA 2600 

TTTAAAATGG TGAATGTATT TTATATTTCT TTTGTAGACA CAAGAGTTAA 2650 

20 TGCATTTTGC TTAATGGAGA TGTATGTAAA CCTAAAATAG CAGTTTGTGC 270 0 

ACAAATTATG TATATGTGAA ATGGAGATGG TTTCTAATTT GCTGATTGAT 2750 

TGCCAGTATT AATTTAAACA ACTGTAGTTG TGGGATGTAG TGGGAAGATT 2800 

TTTTTTTTCC TATAAAATTG GTGGATGTAT GTGTCGGAGA TTTTGATTGT 2 850 

ATGTGTAAAA TAGTGATCCC AGTAACTGTA AAGCTTTAGA ATACAGTTAC 2900 

25. TGACTGTATA GTTGTACAGG TGTTGTTACT TTTAAGAATT TATTGACACA 2 950 

AAGGTGAAAG TCTATTATTG TATTGTAATG TTTAAAGCAT TTAAGGTTTA 3 00 0 

AAAATCCTAC TTCTGTGTAT AAATGTTACC ATTCCTCATA TAACATAACT 3050 

GTGTAGAAAT ACAGTCAACT TCATGTTCAT TAGCATTTCA CTGTTGTCAC 3100 

ATAAATTATG CCCGGAAT 3118 

30 



2) INFORMATION FOR SEQ ID NO: 689 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 183 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 
45 (B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 9 

50 ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

GGCTGGTTCA TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

AGTTAGCTAA AGTAGGATGG AATAAATTGG TTGGAGAAAT TATTAGATTA 150 

GAAGGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 200 

TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 250 

55 GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG AC CATTAGAA 300 

AGAATAGCAA ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

GACATCTTTA GATCATGATA AACAATGGCA ATTTTATGCT GATAAGAAAT 400 

TAAAATTAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 450 

GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA 500 

60 AGGGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 

392 



SUBSTITUTE SHEET (RULE 26) 
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AAATATTTGA ATTAGAATAT CAAGGAAAAA AATATACATA T&3TTTATCT 6 00 

CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 
GGATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 
CAGTTCAAGG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 
5 ACTTGTGTTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 800 
ATATGTAGGA TGTGGTGAAA GAGGTAATGA AATGGCTGAA ATTTTATCCG 850 
ACTTTCCTGA ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 9 00 

CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

AGAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 10 0 0 

10 TGGGTTATAA TGCTACCATG ATGGCTGATA GTACAAGTAG ATGGGCAGAA 1050 

GCCTTAAGAG AAATTTCAGG ACGTTTAGCT GAAATGCCTG CAGATAGTGG 1100 

TTATCCAGCT TATTTAGGTG CTAGATTAGC TTCCTTTTAT GAACGTGCAG 1150 

GAAAAGTCAA ATGTATTGGT TCTCCATCTC GTATAGGATC CATTACAATT 12 00 

GTGGGTGCTG TGTCTCCACC AGGTGGTGAT TTCTCTGACC CTGTAACTAC 1250 

15 AGCAACCATG TCTATTGTTC AAGCATTTTG GGGGTTAGAT AAAAAACTAG 13 0 0 

CTCAAAGAAA ACATTTCCCT TCTGTTAATT GGTCTACATC CTTTTCAAAG 13 50 

TATGTCAGAC AATTAGAACA ATACTTTGAT AATTTTGATC AAGATTTCTT 14 00 

ATCTTTAAGA CAAAAAATTA GTGATATTTT ACAACAAGAA AGTGACTTGA 1450 

ATGATATTGT TCAACTAGTA GGAAAGGATT CATTATCAGA AGACCAAAAA 1500 

2 0 GTTGTTATGG AAGTAGCCAA AATTATTAGA GAAGATTTTC TTCAACAAAA 1550 

TGCATTTAGC GATTATGATT ATATGTGCCC ATTACAAAAA ACAGTTGGTA 1600 

TGATGAGAAT TATTTGCCAC TTTTATGCTC AATGCTTAAG AACATTACAA 1650 

GAATATGACT CAAGAGAAAG AAAAATTGGT TGGGGATCTA TATATAATAC 1700 

ATTAAGACCA ACTATAAATA AAATTACACA TATGAAATTT GAAAACCCAA 1750 

25 AAAATTCAGA TGAATATTTC AAAAAGTATT TTAAGGCACT TGAAGAAGAA 18 0 0 

ATAACAGTAG GTTTAAGAAA CTTGATGGAA AAATGA 183 6 



30 2) INFORMATION FOR SEQ ID NO: 690 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Sac char omyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J054 09 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 

ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

50 AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 3 00 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCGAGCTT 4 00 

55 TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 4 50 

GATCATATTT C CGGTGGTG A TATTTACGGT TCCGTTTTTG AGAATTCGCT 50 0 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 60 0 

GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

60 TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 700 
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TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT J/50 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 800 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 

GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

5 ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA QTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

10 ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 1200 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG AATTAGTAGA ATCCTATAGA AAGGCTTCAA 1300 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 13 50 

GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 14 00 

15 TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 14 50 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

20 TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 18 00 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 18 50 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

25 TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2 000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2 050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

30 TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 22 00 

TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 23 00 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 2350 

GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 24 00 

35 CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 2450 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2500 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2 550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2600 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2650 

40 CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2700 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2 750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2 800 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2 850 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2 9 00 

45 GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3 000 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3050 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

ATGCCGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

50 GTCCATGGCG AATTCGAAAA ATTGTTGAGC ACTATGCAAG AAAGATTTGC 32 00 

TGAATCTACC GATTAA 3216 



55 2) INFORMATION FOR SEQ ID NO: 691 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizo sac char omyces pombet 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S4 7814 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 

ATGGCGGGAG GAATTGAACT GGCCAAGAAG GCTATCAGGA GCCTCAAAAA 50 

TTACGACGAG CATGAAAACC GATATGGATC TATTTTCAGC GTTTCTGGTC 100 

CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT GTTCGATGTA CGAACTCGTT 150 

15 CGCGTTGGTC ATGAAGAACT AGTTGGTGAA GTAATT CGTA TCCATCAAGA 2 00 

TAAATGTACT ATTCAAGTAT ACGAAGAGAC GTCCGGTCTC ACTGTTGGTG 2 50 

ATCCTGTCCA ACGCACTGGA AAGC CATTAT CTGTTGAATT AGGTCCTGGT 3 00 

TTAGCTGAGA CTATTTATGA TGGTATCCAA CGTCCGTTAA AGCAAATTTT 350 

CGACAAATCT CAAAGTATTT ATATTCCTAG AGGTATTAAT ACAGAATCAC 400 

20 TTAATCGTGA GCATAAGTGG GATTTCACAC CAAATAAGGA TTTACGCATT 450 

GGCGATCATG TATCCGGTGG TGATGTTTTC GGTTCTGTAT TTGAAAACTC 500 

TCTTTTCAAT GATCATAAAA TTATGTTACC CCCTAGAGCC CGTGGTACCG 550 

TCACATATAT TGCTGAAGCT GGATCATACC ATGTTGATGA AAAACTTCTT 600 

GAAGTCGAGT TTAATGGCAA GAAACATTCT TTTAGTATGT TGCATACTTG 650 

25 GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA CAACTTAACT GCTAATCAAC 700 

CTTTATTGAC TGGTCAACGT GTTTTGGATG CGTTATACCC CTGTGTTCAA 750 

GGTGGCACTA CTGCTATCCC CGGTGCCTTT GGTTGTGGTA AAACAGTTAT 800 

TTCACAATCT CTTTCTAAGT ACTCTAATTC TGATTTGATT GTTTACGTCG 85 0 

GTTGTGGTGA ACGTGGAAAC GAAATGGCAG AAGTGTTAAT GGATTTCCCA 900 

30 GAACTAACAA TTGATATTAA TGGTAAACCA GAGCCCATTA TGAAGCGTAC 950 

TACATTGGTA GCCAACACTT CTAACATGCC TGTCGCTGCT CGTGAAGCTT 1000 

CCATTTATAC CGGTATTACA CTTGCTGAAT ATTATCGTGA TCAAGGTAAG 1050 

AACGTTTCAA TGATGGCTGA TTCTACATCT CGTTGGGCTG AAGCTTTGCG 1100 

TGAAATTTCT GGTCGTTTGG CTGAGATGCC TGCCGATTCT GGTTATCCCG 1150 

35 CTTATTTGGG TGCCAAATTG GCTTCTTTTT ACGAACGTGC TGGTCGTGCT 1200 

CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA ACAGTTTCAA TTGTTGGAGC 1250 

TGTTTCTCCT CCGGGTGGTG ATTTTTCTGA TCCTGTTACT AGTGCAACCT 1300 

TGGGAATTGT TCAAGTCTTC TGGGGTTTGG ACAAGAAATT GGCCCAACGT 1350 

AAACACTTTC CCTCAATCAA CACCTCTCTT TCCTATTCTA AATACATCAA 14 00 

40 TGCTTTGCAA CCTTGGTATG AGGAAAGAGT TCCAGGCTTT AATACTCTTC 1450 

GTGATCAAAT CAAACAGATC ATTCAACAAG AAGATTCCAT GTTGGAAATT 1500 

ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT GAAACGGATA AAGTTACTTT 1550 

GGACATAGCC GGTATTATTA AGAATGACTT CTTACAACAA AACGGTTATT 1600 

CTGATTACGA TCGCTGTTGC CCTCTTTACA AGACTTATCA TATGATGCGA 1650 

45 AACATGATTG CTTACTACAC AAAGGCTAAA AGTGCCGTTG AAACTGGTAG 1700 

CGTTCCTTGG TCAAAGATTA AAGAAAGTAC TTCAGATATC TTTTATGAGT 1750 

TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 1800 

GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 1850 

GACTGAGTAA I860 

50 

2) INFORMATION FOR SEQ ID NO: 692 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 
5 (B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 

10 ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 100 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 150 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG .CCTCTTTCGC 250 

15 TTGAGCTTGG ACCTGGAATC ATGTCTGAAA TATTTGACGG TATCCAGCGG 3 00 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 3 50 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGC CAT 4 00 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

2 0 GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 70 0 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

25 AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 90 0 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC 950 

TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

30 ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 12 00 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 1250 

35 GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 0 0 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 1350 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 140 0 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 150 0 

4 0 AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 1700 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 1750 

45 AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 1800 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 1833 



50 2) INFORMATION FOR SEQ ID NO: 693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Thezmus thermophilus 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 

GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 

GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 200 

GGGCCTTCCC TTGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 250 

ACGACGGCAT CCAGCGCCCC CTGGAGCGCA TCCGGGAGAA GACGGGGATC 300 

TACATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 350 

GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 4 00 

TCCTGGGCAC GGTGC CCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 450 

CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 

CGTGGAGGAG CCGGTGGTGG TCCTCGAGGA CGGCACCGAG CTCAAGATGT 550 

ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 

CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

CGTGGCCATG GGGGGCACCG . CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 700 

AGACCGTGAC CCAGCAGTCC CTGGC CAAGT GGTCCAACGC CGACGTGGTG 750 

GTCTACGTGG GCTGCGGGGA GCGGGGGAAC GAGATGACCG ACGTGCTCGT 800 

GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 

GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 950 

CTTCTCCGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 1000 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 110 0 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 12 00 

AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 1250 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 13 00 

CCCTTGACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 1350 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 1400 

CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGCCGAG CGCCTCGTCA 1450 

TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGCCTACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 1600 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 170 0 

GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 

CCTTCTGA 1758 



2) INFORMATION FOR SEQ ID NO: 694 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 
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2) INFORMATION FOR SEQ ID NO: 695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 



2 ) INFORMATION FOR SEQ ID NO: 696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
ATIGGICAYR T I GAYCAYGG IAARAC 



2) INFORMATION FOR SEQ ID NO: 697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 
CCIACIGTIC KICCRCCYTC RCG 



2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 

5 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC TCTGACCGCT GCAATCACCA 100 

CCGTACTGGC TAAAACCTAC GGCGGTGCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

10 CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA GACTGCCCGG 250 

GGCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

GGCGCGATCC TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 4 00 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 4 50 

15 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 550 

AGTGGGAAGC GAAAATCCTG GAAC TGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 700 

20 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 750 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 90 0 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 950 

25 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 110 0 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC GGCCGTACCG 1150 

TTGGCGCGGG CGTTGTTGCT. AAAGTTCTGG GCTAA 1185 

30 

2) INFORMATION FOR SEQ ID NO: 699 

35' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
'(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 99 
45 GTIACIGGYT CYTYRARRTT ICCICC 26 



50 



2 ) INFORMATION FOR SEQ ID NO: 700 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 0 

60 
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24 



5 2) INFORMATION FOR SEQ ID NO : 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701 

GTGTTCACGA TCATCGATGC G 21 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 702 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 
CTCTCGATAT CCGCGAAGCG 2 0 

2) INFORMATION FOR SEQ ID NO: 703 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
TATGGAAATT CGAAACATCT 2 0 

50 

2) INFORMATION FOR SEQ ID NO: 704 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



60 



400 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
5 AGTGCTCCAA TTAATGTTGG 20 



10 



2) INFORMATION FOR SEQ ID NO: 705 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705 

20 

GTACAGTTCC AATACCTGAA 2 0 



25 2) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 

TGAAATCTTC ACATCCAACA 20 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 707 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 2 5 

2) INFORMATION FOR SEQ ID NO: 708 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 2 6 bases 

401 
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20 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC I AT I AT 26 



2) INFORMATION FOR SEQ ID NO: 709 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia. burgdorferi 
25 (C) ACCESSION NUMBER: extracted from AE001122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 

ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

3 0 AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 100 

CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 150 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 2 00 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

35 ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 350 

AATTGAGGGA ACTGTTCACC ATCAAATAAT GATTCCATTT TATAAAAGGG 40 0 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGGCGACTA TTCGATTGAT 4 50 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

AATGTCTTTT CATTGGCCTG TTAAAGTTCC TATTACTAAT TATAAGGAAC 550 

40 GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 600 

TTTTTCCCAG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 700 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 750 

ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

45 AATGGACAGG ACTTGTATTA TTTGTAATAC AT CTTCAATG CCAGTTGCAG 850 

CTAGAGAAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 900 

CAAATGGGCC TTGATATTCT TCTTTTGGCA GATTCAACTT CAAGATGGGC 950 

TCAAGCAATG AGAGAAATGT CTGGACGCCT TGAGGAAATT CCTGGCGAGG 10 00 

AGGCTTTTCC GGCATATCTT GAGTCTGTTA TTGCTTCCTT TTATGAAAGG 1050 

50 GCAGGTATTG TAGTTCTTAA TAATGGGGAT ATTGGATCTG TAACAGTTGG 1100 

TGGCTCTGTA AGTCCTGCTG GTGGTAATTT TGAAGAGCCA GTTACTCAAG 1150 

CAACTTTAAA AGTTGTAGGA GCATTTCACG GGCTTACAAG AGAAAGGTCT 12 00 

GATGCTAGGA AATTTCCAGC TATTAGTCCT CTTGAATCTT GGAGTAAATA 1250 

TAAAGGCGTT ATTGATCAAA AAAAGACTGA ATATGCAAGA TCTTTTTTGG 13 00 

55 TGAAAGGTAA TGAAATTAAT CAAATGATGA AAGTTGTTGG AGAAGAAGGC 13 50 

ATAAGTAACG ATGATTTTTT AATTTATTTA AAATCCGAGC TACTTGATTC 1400 

GTGCTATTTG CAGCAAAATT CATTTGATTC TATTGATGCT GCTGTTAGTT 1450 

CAGAGCGTCA AAATTATATG TTTGATATAG TTTATAACAT TCTTAAAACT 1500 

AACTTTGAGT TTTCTGATAA ACTTCAAGCA AGAGATTTTA TAAATGAGTT 1550 

60 AAGGCAAAAT CTTTTAGACA TGAATCTTTC TTCTTTTAAG GATCATAAGT 1600 

402 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAGbTA 1650 
ATTTAG 1656 



2 ) INFORMATION FOR SEQ ID NO: 710 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1818 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 
<B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE000520 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 



GTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 50 

CATTGTGTAT GCCGAGGGCC TCTCTGCGTG CAGCGTATAC GATGTTGTCG 10 0 

25 ACGTAGGGGA AGCATCGCTC ATCGGAGAAA TTATCCGGTT GGATGAGAGC 150 

AAGGCGGTCG TGCAAGTATA CGAGGATGAC ACAGGTATGC GAGTCGGGGA 2 00 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 2 50 

TAATCGGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 3 00 

CAAGAAGACG GCGCCTTCTT GCGTCCTGGT GCGCGTTCAC AACCGCTTGA 350 

30 TGGCTCCGTA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 4 00 

CCCTGTGCGC GGGGATTCCG ATTGCAC CTG GGTCAGTGTT AGGGACCGTG 450 

CAGGAGACTC CTTCTGTTGT GCACACTATC ATGGTTCCTC CTGACATCCG 500 

GGGGAGCGTG CTATCTTCGT TCAAGGGCGC AGGTGCTTAC ACAATAGATG 550 

AAGAAATTGG ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 60 0 

35 TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 650 

GCCACTAGTT ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 700 

AGGGAGGAAC GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 750 

ACGCAGCATG CCGTTGCCAA GTGGTGTGAT GCAGATATTA TCGTGTACAT 800 

CGGCTGCGGA GAGCGGGGCA ACGAGATGAC AGACGTGCTC TCTGAATTTC 850 

40 CCAAACTCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGGACGATT 900 

TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

GTATTCAGGG ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 1000 

TGGC CATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 1050 

TTGTCTGGGC GCATGGAAGA AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

45 CCTTCCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

CCTGTGTGGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 1200 

CTGGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 1250 

CCGTTGCTTT TGGGCCTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 1300 

CTGCCATTGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 13 50 

50 GCATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 1400 

CTTGGATTTG CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 1450 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 1500 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 1600 

55 ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

GCGCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 1700 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

GCACTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 1800 

GGGGAGAAAG TCGAATGA 1818 
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2) INFORMATION FOR SEQ ID NO: 711 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
15 (B) STRAIN: MoPn 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 



CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

20 TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 100 

GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 200 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 250 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 300 

2 5 TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 35 0 

TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 400 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 55 0 

30 TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 

AGATACTCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 700 

CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 750 

GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

35 TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 

AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 

GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 95 0 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

4 0 GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 120 0 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 1250 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 1300 

45 CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 1350 

TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 1400 

AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 1450 

ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 1500 

CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 1550 

50 AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 1600 

ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 1650 

AAAACGCTGA ATGGTCAAAA ATTCCTTTCT GAAGACTATC AGAAGGGGCT 1700 

AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 1750 

TATGCAAACA ATATATACAA GAATTACGG 1779 

55 



2 ) INFORMATION FOR SEQ ID NO: 712 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 

GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

AAACATGTCA GATGCTAGTA TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAACCAG TTATTTCAAC 200 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 250 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 3 00 

AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 350 

ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 4 00 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGCC GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

GTTTTAGCAT GATGCAAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 600 

GAAAAACTAA GTCCCAAAGT AC CGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 700 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 80 0 

AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 850 

CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 90 0 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATTACCATTG CAGAATATTT 950 

CCGTGATATG GGTTA 965 



2) INFORMATION FOR SEQ ID NO: 713 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Methanosarcina barker! 

(C) ACCESSION NUMBER: extracted from J04836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 

GTGGAAGTAA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 50 

CATCGGCTTG CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 

GTTTAATGGG TGAAGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 

GTATATGAAG AGACCGCAGG TATCAAGCCA GGGGAACCCT GTGTATCTAC 2 00 

AGGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 250 

ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGCTTGAAAA AATGGGTAGC 3 00 

TTCATCCAGA GAGGTGTCAG CGCAGATGGG CTTGATCATA AGAAACTCTG 350 

GGATTTCAAA CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAGACG 400 
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TAATTGGTGT TGTACAGGAA ACCGTGAATA TTGAACATAA GATC&TGGTG ' ' ~45fc> 

CCTCCTGATA TCTCAGGTAC AATTTCCGAC ATAAAGAGCG GAAACTTTAC 5l)0 

GGTAGTAGAC ACAATCTGTA CTCTGACTGA TGGGACCGAA TTGCAGATGA 550 

TGCAGAGGTG GCCTGTTCGA AGACCCAGAC CTGTGAAGGC AAAACTTACT 600 

5 CCAACCAGGC CTCTGGTTAC AGGAATGAGA ATCCTTGATG GGCTTTTCCC 650 

TGTGGCAAAA GGCGGAACAG CTGCAATCCC CGGACCTTTC GGATCGGGAA 700 

AGACCGTAAC TCAGCAGTCG CTTGCAAAAT GGAGTGATAC CGAAATTGTG 750 

GTCTACATCG GTTGTGGTGA GCGTGGAAAC GAAATGGCAG ATGTTCTGAG 8 00 

CGAATTCCCT GAACTCGAAG ATCCGCAGAC CGGGCGCCCA CTTATGGAGC 850 

10 GTACTGTTCT TATCGCTAAC ACTTCAAACA TGCCTGTGGC CGCAAGAGAA 900 

GCATCTGTGT ATACCGGAAT CACCATTGCA GAATACTACC GTGACATGGG 950 

ATTAGATGTA TCCCTTATGG CAGACTCCAC CTCAAGGTGG GCAGAAGCCA 1000 

TGAGAGAAAT CTCTTCCCGT CTGGAAGAAA TGCCTGGTGA AGAAGGTTAC 1050 

CCAGCATACC TGTCTGCAAG ACTGGCCGAA TTCTACGAGC GTGCCGGGGT 110 0 

15 TGCGGAGAGT CTTTGCGGCG AAACAGGTTC CATTACTGTT ATTGGAGCAG 1150 

TATCTCCACC TGGCGGTGAC TTCTCAGAGC CTGTTACACA GAATACCCTG 1200 

CGTATCGTAA AAGTGTTCTG GGCTCTCGAT GCCAAACTAT CTCAGAGGCG 1250 

TCACTTCCCG GCCATCAACT GGCTGAACAG TTACAGTCTG TATAAGGACA 1300 

GTCTTAATGA CTGGTTTGCA GATAATGTGG CTCCTGATTA TGTGCCTTTG 1350 

20 AGGGAAAGAG CAATGGAAAT GCTCCAGACA GAATCTGAAC TGCAGGAAAT 1400 

CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC AGACGACCAG CAGCTT CTGC 1450 

TTGAAATCAC CCGTATGCTT AGGGAAATTT TCCTGCAGCA GAATGCATTC 150 0 

CACCCAGTAG ATGCATACAG CCCGTTCGAT CAGCAGTACA AGATCCTTAA 1550 

GGCAATCATG AAATGGGGAG ACGCTGCGAT GGATGCCTTG AAATCAGGTG 1600 

25 TTCCCGTAAC TGAAATTATC AAGCTTGAAT CCAAAAATGT GCTTGCTAAG 1650 

GTCAAGTACG AAGAGAAGTT TGATGAGTCT ATGAATGCTG TCCTGGCACA 1700 

GATGGATAAA GAGTTTGCAT CCCTGAGAGG TAGGTAA 1737 



30 



40 



2) INFORMATION FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1785 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jaimaschii 

(C) ACCESSION NUMBER: extracted from U674 77 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 

ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

50 CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 200 

ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 250 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATTCAGAG GCCTTTAACA 300 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCCCAAGAG GAGTTGATGT 350 

CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 400 

55 AAGGAGATTA TGTTGAAGAA GGAGACATAA TTGGAACTGT TGATGAAACT 450 

CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

60 ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTCTTTACA TTAGCAAAAG 700 
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GAGGAACAGC 
CAGCATCAGT 
ATGTGGAGAA 
ACTTAGAAGA 
5 ATAGCCAACA 
TACAGGAATT 
TATTAACAGC 
TCAGGTAGAT 
AGCTTCAAGA 

10 TAGGGAAAGA 
CCACCAGGAG 
AGTTAAGGTA 
TCCCAGCTAT 
ACAGAGTGGT 

15 TGAAGCAATG 
AGTTAGTTGG 
GTTGCAAGAA 
GGTAGATACC 
TTATGACATT 

20 CCAGCTAAGA 
ATACATCCCA 
AGAAAATTAA 



AGCAATTCCA 
TGGCAAAGTG 
AGAGGAAACG 
TATTAGAACT 
CATCAAACAT 
ACAATTGCAG 
AGATTCAACA 
TGGAAGAAAT 
TTGGCTCAGT 
TAACAGACAA 
GGGACTTCTC 
TTCTGGGCGT 
CAACTGGTTG 
GGAACACAAA 
AGCTTATTAC 
GCCTGATGCA 
TGTTGAGGGA 
TACTGTCCTC 
CTACCAAGAA 
TTTTAGGAGT 
CACGATGAGT 
GAATGAATTA 



GGTCCATTCG 
GTCTGACGCT 
AGATGACAGA 
GGAAACAAAT 
GCCTGTCGCT 
AGTACTTCAG 
TCAAGATGGG 
GCCAGGGGAA 
TCTATGAAAG 
GGATTCGTTT 
AGAACCAGTT 
TAGATGCAAA 
CAGAGTTATT 
TACTGGTCCA 
AAAAAGAGGC 
TTGCCAGATA 
GGATTTCTTA 
CAATGAAACA 
GCATTGAAGG 
TTCAGTTAAG 
TTATAAATGT 
GGTTCATTAA 



GTTCAGGAAA 
GATGTCGTTG 
GGTTATTGAA 
TAATGGATAG 
GCAAGGGAAG 
AGATATGGGT 
CAGAGGCAAT 
GAAGGGTATC 
AGCTGGAAGA 
GTATCGTTGG 
ACATCAAACA 
CTTGGCAAGA 
CATTATACAT 
GATTGGAGAC 
AGAGTTGCAA 
GGGAGAGAGT 
CAGCAAGATG 
GTACTTAATG 
CAGTTGAAAG 
CAAGATATTG 
TAAATCAAAA 
ACTAA 



AACGGTTACT "750 

TTTATATCGG 8 00 

GAGTTCCCAC 850 

AACTGTATTA 900 

CATCTGTCTA 950 

TATGGAGTTT 1000 

GAGAGAAATT 1050 

CAGCATACTT 1100 

GTTATAACCT 1150 

AGCTGTTTCA 12 00 

CACTAAGGAT 12 50 

AGAAGACACT 13 00 

TGATGATGTT 13 50 

AATTAAGAGA 1400 

GAGATTGTTC 1450 

TATTTTAGAA 1500 

CGTTTGATGA 1550 

TTAAAGATAA 1600 

AGGAGTTGAA 1650 

CAAGAATGAA 1700 

GAAAT AATGG 1750 
1785 



25 



2) INFORMATION FOR SEQ ID NO: 715 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1354 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(B) STRAIN: W83 

40' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 

TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 10 0 

TATGCTTGAG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

45 GTCTGCAACA CGACTTGGAC AAGATGGACG GGATCTTCCT CAAACGAGGC 200 

GATTATACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 250 

GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 3 00 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 3 50 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

50 TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 450 

TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

55 ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 750 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

60 AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

407 
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GCTTTCCCGA TGGACTTGTC AGCTATCGTA 
AGGATACGTT TACCTGAACA ACGGTTCGGC 
GTACGGTATC TCCCGCCGGT GGTAACCTCA 
ACCAAGAAAG TGGCTCGCTG CTTCTATGCT 
5 CCGCAAACGT TATCCGGCTG TAAACCCCAT 
TCGAATATCC CGAATTCGAG AGCTATATAT 
ATTATATTCA ACAGAAGTTG GCAGATATAT 
CTGGACTACT AAGGTGAATG AGCTGAAGAT 
AAAT 
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GCCAACTTCT ACG^TCGTGC 1/00 

CGGTTCGGTA ACGTTCATCG 1050 

AAGAGCCTGT GACGGAAAAC 1100 

TTGGAGCAGA ATCGTGCCGA 1150 

CGATAGTTAC TCGAAGTACA 1200 

CGAACCACAT CAGTTTACTC 1250 

GGATCAAATC TTACAGCAGA 1300 

GCGCTTGCAT CAGGGTAAAG 1350 

1354 



2 ) INFORMATION FOR SEQ ID NO: 716 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
25 (B) STRAIN: Type 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 

TTTGACTCAA GGGAAGATTA T AAAAGT AT C GGGACCTCTA GTTATTGCAT 50 

30 CAGGTATGCA GGAGGCTAAT ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 100 

GGGTTAATCG GTGAAATTAT TGAAATGAGA AGAGATCAGG CATCTATCCA 150 

AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT - GTTGTTACAA 2 00 

CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 250 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA 3 00 

35 TGATTTTCTA GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA 3 50 

AGTGGCATTT TGATTCCACT ATAGCAATTG GTCAAAAAGT GAGTACGGGT 4 00 

GATATTCTTG GAACTGTCAA GGAAACCGAG GTAGTTAATC ATAAAATTAT 450 

GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTCTATTGCA TCTGGCGATT 5 00 

TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 550 

4 0 TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT 600 

TTCTAAACGT TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA 650 

TTGATGCATT CTTTCCAGTA ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA 700 

CCGTTTGGAG CAGGAAAGAC AGTTGTACAA CACCAAGTAG CTAAATTTGC 750 

CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT GGAAATGAAA 800 

45 TGACGGATGT ACTGAATGAG TTTCCTGAGT TGATTGACCC TAATACCGGA 850 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC 900 

TGTTGCTGCT CGTGAGGCTT CAATTTATAC AGGAATTACC ATGGCTGAGT 950 

ATTTTCGTGA TATGGGCTAC TCTGTCGCCA TTATGGCTGA TTCAACTTCA 1000 

CGTTGGGCAG AAGCGCTACG TGAAATGTCA GGACGTCTAG AAGAAATGCC 1050 

50 TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC GCTGAATATT 1100 

ATGAAAGAGC AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 1150 

ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA 1200 

ACCAGTTACT CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG 1250 

ATGCTCCGTT GGCACAGCGA CGTCATTTTC CTGCAATTAA CTGGCTTACA 13 00 

55 TCTTATTCAC TATATAAAGA CAGTGTGGGC ACTTATATAG ATGGTAAAGA 1350 

GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC TACTTACAAC 1400 

GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 1450 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA 1500 

TTATTTGCAA CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG 1550 

60 CAAAACAAGA AGCAATGCTA AGTAATATTC TCACTTTTGC TGATCAGGCA 1600 

408 
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AATCATGCTT TAGAGTTGGG TTCTTACTTT 
CGTGGCAGTT CGAGACCGTA TGGCGAGAAG 
GATTAGATGA AATCAAAATT ATATCAAATG 
TTGATATTAG AAACAGGAGG TCTATAAATG 



ACAGAGATTA TGC2&AGGTAC L650 

TAAATATGTT TCAGAAGATA 1700 

AGATTACACA TCAAATTCAT 1750 

AGTGTTAT 1788 



2) INFORMATION FOR SEQ ID NO: 717 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG CTGGTCGAAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 2 00 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 250 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 3 00 

TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 350 

GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 4 00 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 450 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 650 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 700 

CCGTACGACG GACGTGACGG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 800 

ATCGCGATGG AAGAAGGTCT GCG 823 



2 ) INFORMATION FOR SEQ ID NO: 718 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 

409 
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GCACATCCTG CTGGCGCGTC AGGTCGGTGT GCCGTACATC ATCSGTGTTCC /LOO 

TGAACAAGTG CGACATGGTG GACGACGCGG AGCTGCTCGA GCTGGTCGAA 150 

ATGGAAGTGC GCGAACTGCT GTCGAAGTAC GACTTCC CGG GCGACGACAC 200 

GCCGATCATC AAGGGTTCGG CGAAGCTGGC GCTGGAAGGC GACAAGGGCG 250 

5 AGCTGGGCGA AGTGGCGATC ATGAACCTGG CCGACGCGCT GGACACGTAC 300 

ATCCCGACGC CGGAGCGTGC GGTCGATGGC GCGTTCCTGA TGCCGGTGGA 350 

AGACGTGTTC TCGATCTCGG GCCGTGGTAC GGTGGTGACG GGTCGTGTCG 400 

AGCGCGGCGT GATCAAGGTT GGCGAGGAAA TCGAAATCGT CGGTATCAAG 450 

GCGACGGCGA AGACGACCTG CACGGGCGTG GAAATGTTCC GCAAGCTGCT 500 

10 GGATCAGGGT CAGGCGGGCG ACAACGTCGG TATCCTGCTG CGCGGCACGA 550 

AGCGTGAAGA CGTGGAGCGC GGCCAGGTTC TGGCGAAGCC GGGTTCGATC 600 

ACGCCGCACA CGCACTTCAC GGCTGAAGTG TACGTGCTGA GCAAGGACGA 650 

AGGCGGCCGC CACACGCCGT TCTTCAACAA CTACCGTCCG CAGTTCTACT 700 

TCCGTACGAC GGACGTGACG GGCTCGATCG AGCTGC CGAA GGACAAGGAA 750 

15 ATGGTGATGC CGGGCGACAA CGTGTCGATC ACGGTGAAGC TGATCGCGCC 800 

GATCGCGATG GAAGAAGGTC TGCG 824 



20 2) INFORMATION FOR SEQ ID NO: 719 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium hutyricum) 

(B) STRAIN: ATCC 8260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 

TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

40 AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 200 

CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 250 

AAGCCAATCA TGGATTTAAT GGAAGCAGTA GATAGCTATA TCCCAACTCC 3 00 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 3 50 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

45 CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

CACAAGCTGG AGATAACATC GGAGCATTAT TAAGAGGAGT TCAAAGAACT 550 

GATATTGAAA GAGGTCAAGT TTTATCAAAA CCAAATTCAG TACACCCTCA 600 

CACTAAATTT GTAGGTCAAG TATACGTACT TAAAAAAGAA GAAGGTGGAA 650 

50 GACATACTCC ATTCTTTGAT GGATACAGAC CACAATTCTA TTTCAGAACA 700 

ACAGACGTTA CAGGRTCAAT CAAGTTACCA GATGGAATGG AAATGGTAAT 750 

GCCTGGAGAT CACATTGATA TGAATGTTGA ATTAATCACT CCAATCGCAA 800 



55 

2) INFORMATION FOR SEQ ID NO: 720 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
60 (B) TYPE: Nucleic acid 

410 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720 

GGTGCTATCC TGGTTGTTGC TGCATCTGAT GGTCCTATGC CTCAGACTCG 50 

TGAGCACATC CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC ATCGTTGTAT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT CGACCTTGTT 150 

15 GAAATGGAAG TACGTGAGCT GTTAAGCGAG TACGGATTCG ACGGAGATAA 200 

CGCTCCGGTT ATCCGTGGTT CTGCACTGAA GGCTCTGGAA GGTGACGACA 250 

AATACGTTGG CGCTATCAAA GAACTGATGG ATGCAGTTGA TGAATTCATC 300 

CCAGATCCAA CTCGTGAAAC TGACAAACCA TTCCTGATGT CTGTAGAAGA 350 

CGTTATGACA ATCACAGGAC GTGGTACAGT TGCTACAGGA CGTGTTGAGC 400 

20 GTGGGGTAGT AAAACTGGGA GAAGAAGTTG AAATCGTTGG TATCAAGGAT 450 

ACTCAGAAAA CTGTTGTTAC CGGACTGGAA ATGTTCCGTA AGCAGCTGGA 500 

CTTCGCAGAA TCCGGAGACA ACATCGGTGC TCTGCTGCGT GGTATCAACC 550 

GTGACCAGAT TCAGCGTGGA CAGGTTCTTG CTAAACCAGG ATCCGTACAT 600 

CCACACACAA AGTTCAAGGC TCAGGTTTAT GTATTAACAA AAGAAGAAGG 650 

25 TGGACGTCAC ACTCCATTCG TTTCTAACTA CCGTCCTCAG TTCTACTTCC 70 0 

GTACAACTGA CGTAACTGGT GTTATTACAT TACCGGAAGG AACTGAAATG 750 

GTTATGCCTG GTGACAACGT TGAAATGAAC GTTGAGCTGA TTGCTCCAA 799 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 721 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 



50 



55 



60 



TTGTATCAGC 
TTAGCATCAA 
AGACCAAGTA 
GAGAATTATT 
GTAGGATCAG 
AGACTTAATG 
CAGATCAACC 
AGAGGAACAG 
AGATGAAGTA 
TCACAGGAGT 
GATAACATCG 
AGGTCAAGTA 
TAGGTCAAGT 
TTCTTTAACG 



AGCAGATGGT 
GAGTAGGAGT 
GATGATCCAG 
AAGCGAATAC 
CATTAAAAGC 
AAAGCTGTAG 
ATTCTTAATG 
TTGCAACAGG 
CAAATCGTAG 
AGAAATGTTC 
GAGCATTATT 
CTAGCAAAAC 
TTACGTATTA 
GATACAGACC 



CCAATGCCAC AAACAAGAGA 
TAACCACATA GTAGTATTCT 
AATTACTAGA ATTAGTAGAA 
GGATTTGACG GAGACGAATG 
AATCGAAGAA GGGGATGACC 
ATGAATATAT CCCAACTCCA 
CCTGTAGAAG ATGTATTTAC 
AAGAGTTGAA AGAGGAGTAC 
GAATGAAAGA AGAAATCGGA 
AGAAAGATGT TAGATGAAGC 
AAGAGGAGTA CAAAGAGACG 
CAGGTTCAGT AACACCTCAC 
AAGAAAGAAG AAGGTGGAAG 
ACAATTCTAC TTCAGAACAA 
411 



ACACATCCTA 50 

TAAACAAAGC 100 

ATGGAAGTAA 150 

TCCAGTAGTA 200 

AATGCATCCT 250 

GAAAGAGCAA 3 00 

AATTACAGGA 350 

TACACGTAGG 4 00 

AAGACAACAA 450 

AATGGCTGGA 500 

AAATCGAAAG 550 

AAAAAATTCG 600 

ACACACTCCA 650 

CAGACGTAAC 700 
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AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG CCAGGAGACC 7$£ 
ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 789 



2) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 

GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 50 

GCATCAAGAG TTGGTGTTGA CTATATCGTA GTATTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

25 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATCAAG 20 0 

GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 250 

TCCAATCTTA GAATTAATGG AAGGAGTAGA TAGCTACATT CCAACACCAG 300 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 350 

ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 4 00 

30 TCATGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GAAAGCAGAA 450 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCA 500 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 550 

TATCGAAAGA GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGCCACACA 600 

GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 650 

35 CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 700 

AGACGTTACT GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 750 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA AGTAGCAA 798 



40 



50 



2) INFORMATION FOR SEQ ID NO: 723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

55 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTC CA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

60 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 200 

412 
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GGTTCAGCTT TACAAGCATT 
TCCAATCCTT GAGTTAATGG 
AAAGAGCAAC AGATAAGCCA 
ATCACTGGTA GAGGAACAGT 
5 TCACGTAGGA GACGAAGTAG 
AAGTAGTAGT AACAGGAATA 
CAAGCTGGAG ACAACGTAGG 
CATCGAAAGA GGTCAAGTTT 
AGAAATTTGT AGGTCAAGTA 
10 CATACTCCAT TCTTCGATGG 
AGAYGTTACT GGTTCAATCA 
CAGGAGACCA CATTGATATG 



AGAAAACCCA ACAGATGAAA 
AAGCTGTAGA TAGCTACATT 
TTCTTAATGC CAGTAGAAGA 
TGCTACAGGA AGAGTTGAAA 
AAATCGTTGG ATTATCAGAA 
GAAATGTTCA GAAAGTTACT 
AGTTCTTTTA AGAGGAGTTC 
TAGCAAAAGT TGGATCAGTT 
TACGTACTTA AAAAAGAAGA 
ATACAGACCA CAATTCTACT 
AGTTACCAGA TGGAATGGAA 
AACGTTGAAT TAATCACTCA 
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AAGCAATCGC 250 

CCAACTCCAG 300 

TGTATTCACA 350 

GAGGAGTTCT 400 

GACAGCAGAA 450 

AGACGAAGCG 500 

AAAGAACTGA 550 

AAGCCACACA 600 

AGGTGGAAGA 650 

TCAGAACAAC 700 

ATGGTTATGC 750 

AGTAGCTAT 799 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 724 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 801 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

35 GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 200 

GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 250 

ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 300 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 350 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 400 

40 TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 450 

AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 500 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAACCAC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

45 AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 750 

TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 800 

A 801 



2) INFORMATION FOR SEQ ID NO: 725 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

413 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725 

TATGCCTCAA ACTCGTGAAC ACATCTTGTT ATCTCGTAAC GTTGGTGTTC 50 

CTTACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGATGAAGAA 100 

10 TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATACGA 150 

CTTCCCAGGC GACGACACTC CAGTTATCGC TGGTTCAGCT TTGAAAGCTT 200 

TAGAAGGCGA TGCTTCATAC GAAGAAAAAA TCTTAGAATT AATGGCTGCT 250 

GTTGATGAAT ATATCCCAAC ACCAGTTCGT GATACTGACA AACCATTCAT 3 00 

GATGCCAGTC GAAGATGTAT TCTCAATCAC TGGTCGTGGA ACTGTTGCAA 350 

15 CTGGTCGTGT TGAACGTGGA CAAGTT CGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGCAAC TGCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGATT ACGCTGAAGC AGGCGATAAC ATTGGTGCAT 500 

TGTTACGTGG TGTTGCACGT GAAGACATCC AACGTGGACA AGTATTGGCT 550 

AAACCAGCTT CAATCACTCC ACATACAAAA TTCTCTGCAG AAGTTTACGT 60 0 

2 0 TTTAACTAAA GAAGAAGGCG GACGTCATAC TCC 633 



25 



30 



2) INFORMATION FOR SEQ ID NO: 72 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 6 



40 CACGTGAACA 
GTATTCTTAA 
AGTAGAAATG 
ATGACACTCC 
GCTTCTTACG 

45 CATCCCAACT 
AGGATGTATT 
GAACGTGGAC 
TGACGAAACT 
TATTAGACTA 

50 GTTGCTCGTG 
AATCACTCCA 
AAGAAGGTGG 



CATCTTGTTA 
ACAAAATGGA 
GAAGTTCGTG 
AGTTGTTGCA 
AAGAAAAAAT 
CCAACTCGTG 
CTCAATCACT 
AAGTTCGCGT 
GCTCAAACAA 
CGCTGAAGCA 
AAGACATCCA 
CATACAAAAT 
ACGTCATACT 



TCTCGTAACG 
TATGGTTGAT 
ACTTATTATC 
GGTTCTGCTT 
CATGGAATTA 
ACACTGACAA 
GGACGTGGTA 
TGGTGACGTT 
CTGTAACAGG 
GGCGATAACA 
ACGTGGACAA 
TCTCTGCTGA 
CCA 



TAGGTGTTCC 
GACGAAGAAT 
AGAATACGAT 
TGAAAGCTTT 
ATGGCTGCAG 
ACCATTCATG 
CTGTTGCTAC 
GTAGATATCG 
TGTTGAAATG 
TCGGTGCTTT 
GTTTTAGCTA 
AGTATACGTA 



TTACATCGTT 5 0 

TATTAGAATT 10 0 

TTCCCAGGCG 150 

AGAAGGCGAC 20 0 

TTGACGAGTA 250 

ATGCCAGTTG 30 0 

AGGTCGTGTT 350 

TTGGTATCGC 400 

TTCCGTAAAT 450 

ATTACGTGGT 500 

AACCAGCTTC 550 

TTAAGCAAAG 600 

623 



55 

2) INFORMATION FOR SEQ ID NO: 727 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 646 bases 
60 (B) TYPE: Nucleic acid 

414 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DMA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 10 0 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

15 ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 2 00 

AAGCTCTTGA AGG AG AT C C T TCATACGAAG AAAAAATCAT GGAATTGATG 2 50 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG C C AGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

2 0 GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 646 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 728 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma, pi rum 

(B) STRAIN: ATCC 25960D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 



GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

4 5 CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 200 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 250 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 300 

CTGATC CAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 3 50 

50 GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 4 00 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 50 0 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

55 CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 700 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 80 0 

GCTTGTAAAA AGGAAGTAAG TTT 823 
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2) INFORMATION FOR SEQ ID NO: 72 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarium 
15 (B) STRAIN: ATCC 23064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729 

GGAGCAATCT TAGTTGTTGC TGCAACTGAT GGTGCGATGC CTCAAACTCG 50 

2 0 TGAACACGTT TTACTTGCAA AACAAGTTGG TGTTCCTAAA ATCGTTGTTT 10 0 

TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 15 0 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 200 

TGGTGACAAT ACTCCTATTA TTGCTGGTTC AGCATTAAAA GCATTACAAG 250 

GTGATCCAGA ATATGAAAAG AATATTCTAG AACTTATGGA TGCAGTAGAC 300 

25 ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 3 50 

TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 4 00 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 500 

AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 550 

30 GAATTGATAG AGATCAAGTA GAACGTGGAC AAGTTTTGGC CAAACCAAAA 600 

AGTATTATTC CTCACACAGA ATTTGAAGCT GCTGTGTATG TTCTAAAAGC 650 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 700 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 750 

CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

35 TGCTCCTATT GCAGTTGAAG AAGGAA 82 6 



2) INFORMATION FOR SEQ ID NO: 730 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 437 68 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 0 

55 TGGTATGTTC CGCAGCYGAY GGYCCTATGC CTCAAACTCG CGAACACATC 50 

CTGYTGGCTC GCCAAGTAGG YGTACCTTAC ATCATCGTRT TCATGAACAA 10 0 

ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT GAAATGGAAA 150 

TCCGYGACCT GYTGTCMAGC TACGACTTCC CMGGCGACGA CTGCCCAATC 200 

GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGYGAYGCTG GTTACGAAGA 250 

60 GAAAATCTTC GAAYTGGCTG CTGCTTTGGA CAGCTACATC CCAACTCCTG 300 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGCGTGCTGT GGACAAACCK 
ATCTCTGGYC GYGGTACMGT 
CCACGTTGGT GACGAGATYG 
CCACTTGTAC CGGCGTTGAA 
5 GCWGGTGACA ACGTAGGCGT 
AGAGCGTGGT CAAGTATTGG 
AATTCAAAGC AGAAGTATAC 
ACTCCATTCT TCGCYAACTA 
CGTAACYGGT GCAGTTACTT 
10 GTGAGAACGT AACCATTACY 
AAGGTCTGCG 



TTCYTGYTGC CTATCGAAGA 
AGTAACYGGY CGTGTAGAGC 
AAATCGTAGG TCTGAAAGAA 
ATGTTCCGCA AACTGCTGGA 
ATTGYTGCGT GGTACCAARC 
CYAAACCAGG YACCATYACT 
GTACTGAGCA AAGAAGAGGG 
CCGYCCMCAA TTCTACTTCC 
TGGAAGAAGG TGTAGAAATG 
GTAGAACTGA TTGCGCCTAT 
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CGTRTTCTCT 


350 


GCGGTRTCAT 


400 


ACCCAAAAAA 


450 


CGAAGGTCAA 


500 


GTGAAGACGT 


550 


CCSCACACCA 


600 


TGGTCGYCAC 


650 


GTACYACTGA 


700 


GTAATGCCAG 


750 


CGCTAGGAAG 


800 




810 



15 2) INFORMATION FOR SEQ ID NO: 731 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 



30 



35 



40 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 



CTGGTTGTTG 
CCTGCTGGGT 
AATGCGACAT 
GTTCGYGAAC 
CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 
TCAAAGTGGG 
TCTACCTGTA 
TGCCGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
CACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



CTGCGACTGA 
CGTCAGGTAG 
GGTTGATGAC 
TGCTGTCTCA 
TCTGCTCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGT 



CGGCCCGATG 
GCGTTCCGTA 
GAAGAGCTGC 
GTACGACTTC 
AAGCGCTGGA 
GGCTTCCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATTCTGTCC 
ACCGTCCGCA 
CTGCCGGAAG 
TGTTACCCTG 



CCGCAGACCC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGACGCA 
ATTCTTACAT 
CCGATCGAAG 
TCGTGTAGAA 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAA 
GCACCATCAA 
AAAGATGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCGA 



GTGAGCACAT 50 

TTCCTGAACA 100 

TGAAATGGAA 150 

ACACTCCGAT 200 

GAGTGGGAAG 250 

CCCGGAA.CCA 300 

ACGTATTCTC 350 

CGCGGTATCA 400 

GACTCAGAAG 450 

ACGAAGGCCG 500 

CGTGAAGAAA 550 

GCCGCACACC 600 

GCGGCCGTCA 650 

CGTACTACTG 700 
GGTAATGCCG . 750 

TCGCAATGGA 800 

813 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: 



Genomic DNA 
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